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1. Introduction 


This document describes the VP8 compressed video data format, 
together with a discussion of the decoding procedure for the format. 
It is intended to be used in conjunction with, and as a guide to, the 
reference decoder source code provided in Attachment One 

(Section 20). If there are any conflicts between this narrative and 
the reference source code, the reference source code should be 
considered correct. The bitstream is defined by the reference source 
code and not this narrative. 


Like many modern video compression schemes, VP8 is based on 
decomposition of frames into square subblocks of pixels, prediction 
of such subblocks using previously constructed blocks, and adjustment 
of such predictions (as well as synthesis of unpredicted blocks) 
using a discrete cosine transform (hereafter abbreviated as DCT). In 
one special case, however, VP8 uses a Walsh-Hadamard transform 
(hereafter abbreviated as WHT) instead of a DCT. 


Roughly speaking, such systems reduce datarate by exploiting the 
temporal and spatial coherence of most video signals. It is more 
efficient to specify the location of a visually similar portion of a 
prior frame than it is to specify pixel values. The frequency 
segregation provided by the DCT and WHT facilitates the exploitation 
of both spatial coherence in the original signal and the tolerance of 
the human visual system to moderate losses of fidelity in the 
reconstituted signal. 


VP8 augments these basic concepts with, among other things, 


sophisticated usage of contextual probabilities. The result is a 
Significant reduction in datarate at a given quality. 
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Unlike some similar schemes (the older MPEG formats, for example), 
VP8 specifies exact values for reconstructed pixels. Specifically, 
the specification for the DCT and WHT portions of the reconstruction 
does not allow for any "drift" caused by truncation of fractions. 
Rather, the algorithm is specified using fixed-precision integer 
operations exclusively. This greatly facilitates the verification of 
the correctness of a decoder implementation and also avoids 
difficult-to-predict visual incongruities between such 
implementations. 


It should be remarked that, in a complete video playback system, the 
displayed frames may or may not be identical to the reconstructed 
frames. Many systems apply a final level of filtering (commonly 
referred to as postprocessing) to the reconstructed frames prior to 
viewing. Such postprocessing has no effect on the decoding and 
reconstruction of subsequent frames (which are predicted using the 
completely specified reconstructed frames) and is beyond the scope of 
this document. In practice, the nature and extent of this sort of 
postprocessing is dependent on both the taste of the user and on the 
computational facilities of the playback environment. 


The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", 
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this 
document are to be interpreted as described in RFC 2119 [RFC2119]. 


2. Format Overview 


VP8 works exclusively with an 8-bit YUV 4:2:0 image format. In this 
format, each 8-bit pixel in the two chroma planes (U and V) 
corresponds positionally to a 2x2 block of 8-bit luma pixels in the 
Y plane; coordinates of the upper left corner of the Y block are of 
course exactly twice the coordinates of the corresponding chroma 
pixels. When we refer to pixels or pixel distances without 
specifying a plane, we are implicitly referring to the Y plane or to 
the complete image, both of which have the same (full) resolution. 


As is usually the case, the pixels are simply a large array of bytes 
Stored in rows from top to bottom, each row being stored from left to 
right. This "left to right" then "top to bottom" raster-scan order 
is reflected in the layout of the compressed data as well. 


Provision has been made in the VP8 bitstream header for the support 
of a secondary YUV color format, in the form of a reserved bit. 


Occasionally, at very low datarates, a compression system may decide 
to reduce the resolution of the input signal to facilitate efficient 
compression. The VP8 data format supports this via optional 

upscaling of its internal reconstruction buffer prior to output (this 
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is completely distinct from the optional postprocessing discussed 
earlier, which has nothing to do with decoding per se). This 
upsampling restores the video frames to their original resolution. 

In other words, the compression/decompression system can be viewed as 
a "black box", where the input and output are always at a given 
resolution. The compressor might decide to "cheat" and process the 
Signal at a lower resolution. In that case, the decompressor needs 
the ability to restore the signal to its original resolution. 


Internally, VP8 decomposes each output frame into an array of 
macroblocks. A macroblock is a square array of pixels whose Y 
dimensions are 16x16 and whose U and V dimensions are 8x8. 
Macroblock-level data in a compressed frame occurs (and must be 
processed) in a raster order similar to that of the pixels comprising 
the frame. 


Macroblocks are further decomposed into 4x4 subblocks. Every 
macroblock has 16 Y subblocks, 4 U subblocks, and 4 V subblocks. Any 
subblock-level data (and processing of such data) again occurs in 
raster order, this time in raster order within the containing 
macroblock. 


As discussed in further detail below, data can be specified at the 
levels of both macroblocks and their subblocks. 


Pixels are always treated, at a minimum, at the level of subblocks, 
which may be thought of as the "atoms" of the VP8 algorithm. In 
particular, the 2x2 chroma blocks corresponding to 4x4 Y subblocks 
are never treated explicitly in the data format or in the algorithm 
Specification. 


The DCT and WHT always operate at a 4x4 resolution. The DCT is used 
for the 16Y, 4U, and 4V subblocks. The WHT is used (with some but 
not all prediction modes) to encode a 4x4 array comprising the 
average intensities of the 16 Y subblocks of a macroblock. These 
average intensities are, up to a constant normalization factor, 
nothing more than the Oth DCT coefficients of the Y subblocks. This 
"higher-level" WHT is a substitute for the explicit specification of 
those coefficients, in exactly the same way as the DCT of a subblock 
substitutes for the specification of the pixel values comprising the 
subblock. We consider this 4x4 array as a second-order subblock 
called Y2, and think of a macroblock as containing 24 "real" 
subblocks and, sometimes, a 25th "virtual" subblock. This is dealt 
with further in Section 13. 


The frame layout used by the reference decoder may be found in the 
file vpx image.h (Section 20.23). 
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32 


Compressed Frame Types 
There are only two types of frames in VP8. 


Intraframes (also called key frames and, in MPEG terminology, 
I-frames) are decoded without reference to any other frame in a 
sequence; that is, the decompressor reconstructs such frames 
beginning from its "default" state. Key frames provide random access 
(or seeking) points in a video stream. 


Interframes (also called prediction frames and, in MPEG terminology, 
P-frames) are encoded with reference to prior frames, specifically 
all prior frames up to and including the most recent key frame. 
Generally speaking, the correct decoding of an interframe depends on 
the correct decoding of the most recent key frame and all ensuing 
frames. Consequently, the decoding algorithm is not tolerant of 
dropped frames: In an environment in which frames may be dropped or 
corrupted, correct decoding will not be possible until a key frame is 
correctly received. 


In contrast to MPEG, there is no use of bidirectional prediction. No 
frame is predicted using frames temporally subsequent to it; there is 
no analog to an MPEG B-frame. 


Secondly, VP8 augments these notions with that of alternate 
prediction frames, called golden frames and altref frames 
(alternative reference frames). Blocks in an interframe may be 
predicted using blocks in the immediately previous frame as well as 
the most recent golden frame or altref frame. Every key frame is 
automatically golden and altref, and any interframe may optionally 
replace the most recent golden or altref frame. 


Golden frames and altref frames may also be used to partially 
overcome the intolerance to dropped frames discussed above: If a 
compressor is configured to code golden frames only with reference to 
the prior golden frame (and key frame), then the "substream" of key 
and golden frames may be decoded regardless of loss of other 
interframes.  Roughly speaking, the implementation requires (on the 
compressor side) that golden frames subsume and recode any context 
updates effected by the intervening interframes. A typical 
application of this approach is video conferencing, in which 
retransmission of a prior golden frame and/or a delay in playback 
until receipt of the next golden frame is preferable to a larger 
retransmit and/or delay until the next key frame. 
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4. 


Overview of Compressed Data Format 


The input to a VP8 decoder is a sequence of compressed frames whose 
order matches their order in time. Issues such as the duration of 
frames, the corresponding audio, and synchronization are generally 
provided by the playback environment and are irrelevant to the 
decoding process itself; however, to aid in fast seeking, a start 
code is included in the header of each key frame. 


The decoder is simply presented with a sequence of compressed frames 
and produces a sequence of decompressed (reconstructed) YUV frames 
corresponding to the input sequence. As stated in the Introduction, 
the exact pixel values in the reconstructed frame are part of VP8's 
Specification. This document specifies the layout of the compressed 
frames and gives unambiguous algorithms for the correct production of 
reconstructed frames. 


The first frame presented to the decompressor is of course a key 


frame. This may be followed by any number of interframes; the 
correct reconstruction of each frame depends on all prior frames up 
to the key frame. The next key frame restarts this process: The 


decompressor resets to its default initial condition upon reception 
of a key frame, and the decoding of a key frame (and its ensuing 
interframes) is completely independent of any prior decoding. 


At the highest level, every compressed frame has three or more 


pieces. It begins with an uncompressed data chunk comprising 

10 bytes in the case of key frames and 3 bytes for interframes. This 
is followed by two or more blocks of compressed data (called 
partitions). These compressed data partitions begin and end on byte 
boundaries. 


The first compressed partition has two subsections: 
1. Header information that applies to the frame as a whole. 


2.  Per-macroblock information specifying how each macroblock is 
predicted from the already-reconstructed data that is available 
to the decompressor. 


As stated above, the macroblock-level information occurs in raster- 
Scan order. 


The rest of the partitions contain, for each block, the DCT/WHT 
coefficients (quantized and logically compressed) of the residue 
signal to be added to the predicted block values. It typically 
accounts for roughly 70% of the overall datarate. VP8 supports 
packing the compressed DCT/WHT coefficients' data from macroblock 
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rows into separate partitions. If there is more than one partition 
for these coefficients, the sizes of the partitions -- except the 
last partition -- in bytes are also present in the bitstream right 
after the above first partition. Each of the sizes is a 3-byte data 
item written in little endian format. These sizes provide the 
decoder direct access to all DCT/WHT coefficient partitions, which 
enables parallel processing of the coefficients in a decoder. 


The separate partitioning of the prediction data and coefficient data 
also allows flexibility in the implementation of a decompressor: An 
implementation may decode and store the prediction information for 
the whole frame and then decode, transform, and add the residue 
Signal to the entire frame, or it may simultaneously decode both 
partitions, calculating prediction information and adding in the 
residue signal for each block in order. The length field in the 
frame tag, which allows decoding of the second partition to begin 
before the first partition has been completely decoded, is necessary 
for the second "block-at-a-time" decoder implementation. 


All partitions are decoded using separate instances of the boolean 
entropy decoder described in Section 7. Although some of the data 
represented within the partitions is conceptually "flat" (a bit is 
just a bit with no probabilistic expectation one way or the other), 
because of the way such coders work, there is never a direct 
correspondence between a "conceptual bit" and an actual physical bit 
in the compressed data partitions. Only in the 3- or 10-byte 
uncompressed chunk described above is there such a physical 
correspondence. 


A related matter is that seeking within a partition is not supported. 
The data must be decompressed and processed (or at least stored) in 
the order in which it occurs in the partition. 


While this document specifies the ordering of the partition data 
correctly, the details and semantics of this data are discussed in a 
more logical fashion to facilitate comprehension. For example, the 
frame header contains updates to many probability tables used in 
decoding per-macroblock data. The per-macroblock data is often 
described before the layouts of the probabilities and their updates, 
even though this is the opposite of their order in the bitstream. 


5. Overview of the Decoding Process 


A VP8 decoder needs to maintain four YUV frame buffers whose 
resolutions are at least equal to that of the encoded image. These 
buffers hold the current frame being reconstructed, the immediately 
previous reconstructed frame, the most recent golden frame, and the 
most recent altref frame. 
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Most implementations will wish to "pad" these buffers with 
"invisible" pixels that extend a moderate number of pixels beyond all 


four edges of the visible image. This simplifies interframe 
prediction by allowing all (or most) prediction blocks -- which are 
not guaranteed to lie within the visible area of a prior frame -- to 


address usable image data. 


Regardless of the amount of padding chosen, the invisible rows above 
(or below) the image are filled with copies of the top (or bottom) 
row of the image; the invisible columns to the left (or right) of the 
image are filled with copies of the leftmost (or rightmost) visible 
row; and the four invisible corners are filled with copies of the 
corresponding visible corner pixels. The use of these prediction 
buffers (and suggested sizes for the halo) will be elaborated on in 
the discussion of motion vectors, interframe prediction, and 
sub-pixel interpolation later in this document. 


As will be seen in the description of the frame header, the image 
dimensions are specified (and can change) with every key frame. 
These buffers (and any other data structures whose size depends on 
the size of the image) should be allocated (or re-allocated) 
immediately after the dimensions are decoded. 


Leaving most of the details for later elaboration, the following is 
an outline of the decoding process. 


First, the frame header (the beginning of the first data partition) 
is decoded. Altering or augmenting the maintained state of the 
decoder, this provides the context in which the per-macroblock data 
can be interpreted. 


The macroblock data occurs (and must be processed) in raster-scan 
order. This data comes in two or more parts. The first (prediction 
or mode) part comes in the remainder of the first data partition. 
The other parts comprise the data partition(s) for the DCT/WHT 
coefficients of the residue signal. For each macroblock, the 
prediction data must be processed before the residue. 


Each macroblock is predicted using one (and only one) of four 
possible frames. All macroblocks in a key frame, and all intra-coded 
macroblocks in an interframe, are predicted using the already-decoded 


macroblocks in the current frame.  Macroblocks in an interframe may 
also be predicted using the previous frame, the golden frame, or the 
altref frame. Such macroblocks are said to be inter-coded. 
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The purpose of prediction is to use already-constructed image data to 
approximate the portion of the original image being reconstructed. 
The effect of any of the prediction modes is then to write a 
macroblock-sized prediction buffer containing this approximation. 


Regardless of the prediction method, the residue DCT signal is 
decoded, dequantized, reverse-transformed, and added to the 
prediction buffer to produce the (almost final) reconstruction value 
of the macroblock, which is stored in the correct position of the 
current frame buffer. 


The residue signal consists of 24 (sixteen Y, four U, and four V) 4x4 
quantized and losslessly compressed DCT transforms approximating the 
difference between the original macroblock in the uncompressed source 
and the prediction buffer. For most prediction modes, the Oth 
coefficients of the sixteen Y subblocks are expressed via a 25th WHT 
of the second-order virtual Y2 subblock discussed above. 


Intra-prediction exploits the spatial coherence of frames. The 16x16 
luma (Y) and 8x8 chroma (UV) components are predicted independently 
of each other using one of four simple means of pixel propagation, 
starting from the already-reconstructed (16-pixel-long luma, 8-pixel- 
long chroma) row above, and column to the left of, the current 
macroblock. The four methods are: 


1. Copying the row from above throughout the prediction buffer. 


2. Copying the column from the left throughout the prediction 
buffer. 


3. Copying the average value of the row and column throughout the 
prediction buffer. 


4. Extrapolation from the row and column using the (fixed) second 
difference (horizontal and vertical) from the upper left corner. 


Additionally, the sixteen Y subblocks may be predicted independently 
of each other using one of ten different modes, four of which are 4x4 
analogs of those described above, augmented with six "diagonal" 
prediction methods. There are two types of predictions, one intra 
and one prediction (among all the modes), for which the residue 
signal does not use the Y2 block to encode the DC portion of the 
Sixteen 4x4 Y subblock DCTs. This "independent Y subblock" mode has 
no effect on the 8x8 chroma prediction. 
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Inter-prediction exploits the temporal coherence between nearby 
frames. Except for the choice of the prediction frame itself, there 
is no difference between inter-prediction based on the previous frame 
and that based on the golden frame or altref frame. 


Inter-prediction is conceptually very simple. While, for reasons of 
efficiency, there are several methods of encoding the relationship 
between the current macroblock and corresponding sections of the 
prediction frame, ultimately each of the sixteen Y subblocks is 
related to a 4x4 subblock of the prediction frame, whose position in 
that frame differs from the current subblock position by a (usually 
small) displacement. These two-dimensional displacements are called 
motion vectors. 


The motion vectors used by VP8 have quarter-pixel precision. 
Prediction of a subblock using a motion vector that happens to have 
integer (whole number) components is very easy: The 4x4 block of 
pixels from the displaced block in the previous, golden, or altref 
frame is simply copied into the correct position of the current 
macroblock's prediction buffer. 


Fractional displacements are conceptually and implementationally more 
complex. They require the inference (or synthesis) of sample values 
that, strictly speaking, do not exist. This is one of the most basic 
problems in signal processing, and readers conversant with that 
subject will see that the approach taken by VP8 provides a good 
balance of robustness, accuracy, and efficiency. 


Leaving the details for the implementation discussion below, the 
pixel interpolation is calculated by applying a kernel filter (using 
reasonable-precision integer math) three pixels on either side, both 
horizontally and vertically, of the pixel to be synthesized. The 
resulting 4x4 block of synthetic pixels is then copied into position 
exactly as in the case of integer displacements. 


Each of the eight chroma subblocks is handled similarly. Their 
motion vectors are never specified explicitly; instead, the motion 
vector for each chroma subblock is calculated by averaging the 
vectors of the four Y subblocks that occupy the same area of the 
frame. Since chroma pixels have twice the diameter (and four times 
the area) of luma pixels, the calculated chroma motion vectors have 
1/8-pixel resolution, but the procedure for copying or generating 
pixels for each subblock is essentially identical to that done in the 
luma plane. 
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After all the macroblocks have been generated (predicted and 
corrected with the DCT/WHT residue), a filtering step (the loop 
filter) is applied to the entire frame. The purpose of the loop 
filter is to reduce blocking artifacts at the boundaries between 
macroblocks and between subblocks of the macroblocks. The term "loop 
filter" is used because this filter is part of the "coding loop"; 
that is, it affects the reconstructed frame buffers that are used to 
predict ensuing frames. This is distinguished from the 
postprocessing filters discussed earlier, which affect only the 
viewed video and do not "feed into" subsequent frames. 


Next, if signaled in the data, the current frame may replace the 
golden frame prediction buffer and/or the altref frame buffer. 


The halos of the frame buffers are next filled as specified above. 
Finally, at least as far as decoding is concerned, the (references 
to) the "current" and "last" frame buffers should be exchanged in 
preparation for the next frame. 


Various processes may be required (or desired) before viewing the 
generated frame. As discussed in the frame dimension information 
below, truncation and/or upscaling of the frame may be required. 
Some playback systems may require a different frame format (RGB, 
YUY2, etc.). Finally, as mentioned in the Introduction, further 
postprocessing or filtering of the image prior to viewing may be 
desired. Since the primary purpose of this document is a decoding 
Specification, the postprocessing is not specified in this document. 


While the basic ideas of prediction and correction used by VP8 are 
straightforward, many of the details are quite complex. The 
management of probabilities is particularly elaborate. Not only do 
the various modes of intra-prediction and motion vector specification 
have associated probabilities, but they, together with the coding of 
DCT coefficients and motion vectors, often base these probabilities 
on a variety of contextual information (calculated from what has been 
decoded so far), as well as on explicit modification via the frame 
header. 


The "top-level" of decoding and frame reconstruction is implemented 
in the reference decoder file dixie.c (Section 20.4). 


This concludes our summary of decoding and reconstruction; we 
continue by discussing the individual aspects in more depth. 


A reasonable "divide and conquer" approach to implementation of a 
decoder is to begin by decoding streams composed exclusively of key 
frames. After that works reliably, interframe handling can be added 
more easily than if complete functionality were attempted 
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immediately. In accordance with this, we first discuss components 
needed to decode key frames (most of which are also used in the 
decoding of interframes) and conclude with topics exclusive to 
interframes. 


6. Description of Algorithms 


As the intent of this document, together with the reference decoder 
Source code, is to specify a platform-independent procedure for the 
decoding and reconstruction of a VP8 video stream, many (small) 
algorithms must be described exactly. 


Due to its near-universality, terseness, ability to easily describe 
calculation at specific precisions, and the fact that On2's reference 
VP8 decoder is written in C, these algorithm fragments are written 
using the C programming language, augmented with a few simple 
definitions below. 


The standard (and best) reference for C is [Kernighan]. 


Many code fragments will be presented in this document. Some will be 
nearly identical to corresponding sections of the reference decoder; 
others will differ.  Roughly speaking, there are three reasons for 
such differences: 


1. For reasons of efficiency, the reference decoder version may be 
less obvious. 


2. The reference decoder often uses large data structures to 
maintain context that need not be described or used here. 


3. The authors of this document felt that a different expression of 
the same algorithm might facilitate exposition. 


Regardless of the chosen presentation, the calculation effected by 
any of the algorithms described here is identical to that effected by 


the corresponding portion of the reference decoder. 


All VP8 decoding algorithms use integer math. To facilitate 
Specification of arithmetic precision, we define the following types. 


sene Begin -code block: =====3====3=====2=2==2=3 SSS 0c 


typedef signed char int8; /* signed int exactly 8 bits wide */ 


typedef unsigned char uint8; /* unsigned "" */ 
typedef short intl6; /* signed int exactly 16 bits wide */ 
typedef unsigned intl6 uintl6; /* unsigned "" */ 
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/* int32 is a signed integer type at least 32 bits wide */ 


typedef long int32; /* guaranteed to work on all systems */ 
typedef int  int32; /* will be more efficient on some systems */ 


typedef unsigned int32 uint32; 


/* unsigned integer type, at least 16 bits wide, whose exact size 
is most convenient to whatever processor we are using */ 


typedef unsigned int uint; 


/* While pixels themselves are 8-bit unsigned integers, 
pixel arithmetic often occurs at 16- or 32-bit precision and 
the results need to be "saturated" or clamped to an 8-bit 
range. */ 


typedef uint8 Pixel; 
Pixel clamp255 (int32 v) ( return v < 0? 0 : (v < 255? v : 255);) 


/ As is elaborated in the discussion of the bool decoder below, 
VP8 represents probabilities as unsigned 8-bit numbers. %/ 


typedef uint8 Prob; 
== =>="Bnd. code block '=====8 oro oo mM 222 sees oni yo sodo P on sI??t5 


We occasionally need to discuss mathematical functions involving 
honest-to-goodness "infinite precision" real numbers. The DCT is 
first described via the cosine function cos; the ratio of the lengths 
of the circumference and diameter of a circle is denoted pi; at one 
point, we take a (base 1/2) logarithm, denoted log; and pow(x, y) 
denotes x raised to the power y. If x = 2 and y is a small 
non-negative integer, pow(2, y) may be expressed in C as 1 << y. 


Finally, we sometimes need to divide signed integers by powers of 
two; that is, we occasionally right-shift signed numbers. The 
behavior of such shifts (i.e., the propagation of the sign bit) is, 
perhaps surprisingly, not defined by the C language itself and is 
left up to individual compilers. Because of the utility of this 
freguently needed operation, it is at least arguable that it should 
be defined by the language (to naturally propagate the sign bit) and, 
at a minimum, should be correctly implemented by any reasonable 
compiler. In the interest of strict portability, we attempt to call 
attention to these shifts when they arise. 
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7. 


Boolean Entropy Decoder 


As discussed in the overview above, essentially the entire VP8 data 
Stream is encoded using a boolean entropy coder. 


An understanding of the bool decoder is critical to the 
implementation of a VP8 decompressor, so we discuss the bool decoder 
in detail. It is easier to comprehend the bool decoder in 
conjunction with the bool encoder used by the compressor to write the 
compressed data partitions. 


The bool encoder encodes (and the bool decoder decodes) one bool 
(zero-or-one boolean value) at a time. Its purpose is to losslessly 
compress a seguence of bools for which the probability of their being 
zero or one can be well-estimated (via constant or previously coded 
information) at the time they are written, using identical 
corresponding probabilities at the time they are read. 


As the reader is probably aware, if a bool is much more likely to be 
zero than one (for instance), it can, on average, be faithfully 
encoded using much less than one bit per value. The bool encoder 
exploits this. 


In the 1940s, [Shannon] proved that there is a lower bound for the 
average datarate of a faithful encoding of a seguence of bools (whose 
probability distributions are known and are independent of each 
other) and also that there are encoding algorithms that approximate 
this lower bound as closely as one wishes. 


If we encode a seguence of bools whose probability of being zero is p 
(and whose probability of being 1 is 1-p), the lowest possible 
datarate per value is 


plog(p) + (1-p)log(1-p); 


taking the logarithms to the base 1/2 expresses the datarate in bits/ 
value. 


We give two simple examples. At one extreme, if p = 1/2, then log(p) 
= log(l-p) = 1, and the lowest possible datarate per bool is 1/2 + 
1/2 = 1; that is, we cannot do any better than simply literally 
writing out bits. At another extreme, if p is very small, say p = 
1/1024, then log(p)=10, log(1-p) is roughly .0014, and the lowest 
possible datarate is approximately 10/1024 + .0014, roughly 1/100 of 
a bit per bool. 


Bankoski, et al. Informational [Page 16] 


RFC 6386 VP8 Data Format and Decoding Guide November 2011 


Because most of the bools in the VP8 datastream have zero- 
probabilities nowhere near 1/2, the compression provided by the 
bool encoder is critical to the performance of VP8. 


The boolean coder used by VP8 is a variant of an arithmetic coder. 
An excellent discussion of arithmetic coding (and other lossless 


compression technigues) can be found in [Bell]. 


7.1. Underlying Theory of Coding 


The basic idea used by the boolean coder is to consider the entire 
data stream (either of the partitions in our case) as the binary 
expansion of a single number x with 0 << x < 1. The bits (or bytes) 
in x are of course written from high to low order, and if b[j] (BI[jl) 
is the j^(th) bit (byte) in the partition, the value x is simply the 
sum (starting with j = 1) of pow(2, -j) * b[j] or pow(256, -j) * 
B[j]- 


Before the first bool is coded, all values of x are possible. 


The coding of each bool restricts the possible values of x in 
proportion to the probability of what is coded. If pl is the 
probability of the first bool being zero and a zero is coded, the 
range of possible values of x is restricted to 0 <= x « pl. If a one 
is coded, the range becomes pl <= x < 1. 


The coding continues by repeating the same idea. At every stage, 


there is an interval a << x « b of possible values of x. If p is the 
probability of a zero being coded at this stage and a zero is coded, 
the interval becomes a << x < a+ (p(b-a)). If a one is coded, the 


possible values of x are restricted to a t (p(b-a)) << x < b. 


Assuming that only finitely many values are to be coded, after the 
encoder has received the last bool, it can write as its output any 
value x that lies in the final interval.  VP8 simply writes the left 
endpoint of the final interval.  Conseguently, the output it would 
make if encoding were to stop at any time either increases or stays 
the same as each bool is encoded. 


Decoding parallels encoding. The decoder is presented with the 
number x, which has only the initial restriction 0 << x < 1. To 
decode the first bool, the decoder is given the first probability pl. 
If x < pl, a zero is decoded; if x >= pl, a one is decoded. In 
either case, the new restriction on x -- that is, the interval of 
possible values of x -- is remembered. 


Bankoski, et al. Informational [Page 17] 


RFC 6386 VP8 Data Format and Decoding Guide November 2011 


Decoding continues in exactly the same way: If a << x « b is the 
current interval and we are to decode a bool with zero-probability p, 
we return a zero if a << x < a+ (p(b-a)) and a one if a + (p(b-a)) 
<< x < b. In either case, the new restriction is remembered in 
preparation for decoding the next bool. 


The process outlined above uses real numbers of infinite precision to 
express the probabilities and ranges. It is true that, if one could 
actualize this process and coded a large number of bools whose 
supplied probabilities matched their value distributions, the 
datarate achieved would approach the theoretical minimum as the 
number of bools encoded increased. 


Unfortunately, computers operate at finite precision, and an 
approximation to the theoretically perfect process described above is 
necessary. Such approximation increases the datarate but, at guite 
moderate precision and for a wide variety of data sets, this increase 
is negligible. 


The only conceptual limitations are, first, that coder probabilities 
must be expressed at finite precision and, second, that the decoder 
be able to detect each individual modification to the value interval 
via examination of a fixed amount of input. As a practical matter, 
many of the implementation details stem from the fact that the coder 
can function using only a small "window" to incrementally read or 
write the arbitrarily precise number x. 


7.2. Practical Algorithm Description 


VP8's boolean coder works with 8-bit probabilities p. The range of 
such p is 0 << p << 255; the actual probability represented by p is 
p/256. Also, the coder is designed so that decoding of a bool 
requires no more than an 8-bit comparison, and so that the state of 
both the encoder and decoder can be easily represented using a small 
number of unsigned 16-bit integers. 


The details are most easily understood if we first describe the 
algorithm using bit-at-a-time input and output. Aside from the 
ability to maintain a position in this bitstream and write/read bits, 
the encoder also needs the ability to add 1 to the bits already 
output; after writing n bits, adding 1 to the existing output is the 
same thing as adding pow(2, -n) to x. 
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Together with the bit position, the encoder must maintain two 
unsigned 8-bit numbers, which we call "bottom" and "range". Writing 
w for the n bits already written and S = pow(2, - n - 8) for the 
scale of the current bit position one byte out, we have the following 
constraint on all future values v of w (including the final value 

v = X): 


w + (S * bottom ) << v < w t (S * ( bottom + range ) ) 


Thus, appending bottom to the already-written bits w gives the left 
endpoint of the interval of possible values, appending bottom + range 
gives the right endpoint, and range itself (scaled to the current 
output position) is the length of the interval. 


So that our probabilistic encodings are reasonably accurate, we do 
not let range vary by more than a factor of two: It stays within the 
bounds 128 <= range <= 255. 


The process for encoding a boolean value val whose probability of 
being zero is prob / 256 -- and whose probability of being one is 
( 256 - prob ) / 256 -- with 1 <= prob <= 255 is as follows. 


Using an unsigned 16-bit multiply followed by an unsigned right 
shift, we calculate an unsigned 8-bit split value: 


split = 1 + (((range - 1) * probability)]] >> 8) 


split is approximately ( prob / 256 ) * range and lies within the 
bounds 1 << split <= range - 1. These bounds ensure the correctness 
of the decoding procedure described below. 


If the incoming boolean val to be encoded is false, we leave the left 
interval endpoint bottom alone and reduce range, replacing it by 
split. If the incoming val is true, we move up the left endpoint to 
bottom t split, propagating any carry to the already-written value w 
(this is where we need the ability to add 1 to w), and reduce range 
to range - split. 


Regardless of the value encoded, range has been reduced and now has 
the bounds 1 «- range «- 254. If range « 128, the encoder doubles it 
and shifts the high-order bit out of bottom to the output as it also 
doubles bottom, repeating this process one bit at a time until 128 «- 
range <= 255. Once this is completed, the encoder is ready to accept 
another bool, maintaining the constraints described above. 


After encoding the last bool, the partition may be completed by 
appending bottom to the bitstream. 
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The decoder mimics the state of the encoder. It maintains, together 
with an input bit position, two unsigned 8-bit numbers, a range 
identical to that maintained by the encoder and a value.  Decoding 
one bool at a time, the decoder (in effect) tracks the same left 
interval endpoint as does the encoder and subtracts it from the 
remaining input.  Appending the unread portion of the bitstream to 
the 8-bit value gives the difference between the actual value encoded 
and the known left endpoint. 


The decoder is initialized by setting range - 255 and reading the 
first 16 input bits into value. The decoder maintains range and 
calculates split in exactly the same way as does the encoder. 


To decode a bool, it compares value to split; if value « split, the 


bool is zero, and range is replaced with split. If value >= split, 
the bool is one, range is replaced with range - split, and value is 
replaced with value - split. 


Again, range is doubled one bit at a time until it is at least 128. 
The value is doubled in parallel, shifting a new input bit into the 
bottom each time. 


Writing Value for value together with the unread input bits and Range 
for range extended indefinitely on the right by zeros, the condition 


Value « Range is maintained at all times by the decoder. In 
particular, the bits shifted out of value as it is doubled are always 
zero. 


7.3. Actual Implementation 


The C code below gives complete implementations of the encoder and 
decoder described above. While they are logically identical to the 
"pit-at-a-time" versions, they internally buffer a couple of extra 
bytes of the bitstream. This allows I/O to be done (more 

practically) a byte at a time and drastically reduces the number of 
carries the encoder has to propagate into the already-written data. 


Another (logically equivalent) implementation may be found in the 
reference decoder file bool decoder.h (Section 20.2). 
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==== Begin code. block ss-e5 roso 5-5 sss >5- o 929222 
/* Encoder first %/ 


typedef struct { 
uint8 *output; /* ptr to next byte to be written */ 
uint32 range; /* 128 <= range <= 255 */ 
uint32 bottom; /* minimum value of remaining output */ 
int bit_count; /* # of shifts before an output byte 
is available */ 
) bool encoder; 


/* Must set initial state of encoder before writing any bools. */ 


void init bool encoder(bool encoder *e, uint8 *start partition) 
{ 

e->output = start_partition; 

e->range = 255; 

e->bottom = 0; 

e->bit_count = 24; 


} 


/* Encoding very rarely produces a carry that must be propagated 
to the already-written output. The arithmetic guarantees that 
the propagation will never go beyond the beginning of the 
output. Put another way, the encoded value x is always less 
than one. */ 


void add_one_to_output (uint8 *q) 
{ 
while (*--q == 255) 
*q = 0; 
++*g; 
) 


/* Main function writes a bool_value whose probability of being 
zero is (expected to be) prob/256. */ 


void write bool(bool encoder *e, Prob prob, int bool value) 
( 
/* split is approximately (range * prob) / 256 and, 
crucially, is strictly bigger than zero and strictly 
smaller than range */ 


uint32 split < 1 1 (((e-»range - 1) * prob) >> 8); 
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if (bool value) 1 
e->bottom += split; /* move up bottom of interval */ 


e->range -= split; /* with corresponding decrease in range %/ 
) else 
e-»range - split; /* decrease range, leaving bottom alone */ 


while (e-»range « 128) 
{ 


e->range <<= 1; 


if (e->bottom & (1 << 31)) /* detect carry */ 
add one to output (e->output) ; 


e->bottom <<= 1; /* before shifting bottom */ 

if (!--e->bit count) ( /% write out high byte of bottom ... */ 
*e->output++ = (uint8) (e->bottom >> 24); 
e->bottom &= (1 << 24) - 1; /* ... keeping low 3 bytes */ 


e->bit_count = 8; /* 8 shifts until next output */ 


} 


/* Call this function (exactly once) after encoding the last 
bool value for the partition being written */ 


void flush bool encoder (bool encoder *e) 
{ 

int c = e->bit_count; 

uint32 v = e->bottom; 


if (v & (1 << (32 - c))) /* propagate (unlikely) carry */ 
add one to output (e-»output); 
v <<< c & 7; /* before shifting remaining output */ 
c >>= 3; /* to top of internal buffer */ 
while (--c >= 0) 
v <<= 8; 
c= 4; 
while (--c >= 0) { /* write remaining data, possibly padded */ 
*e->outputt+ = (uint8) (v >> 24); 
v <<= 8; 


} 
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/* Decoder state exactly parallels that of the encoder. 
"value", together with the remaining input, equals the 
complete encoded number x less the left endpoint of the 
current coding interval. */ 


typedef struct { 


uint8 *input; /* pointer to next compressed data byte */ 
uint32 range; /* always identical to encoder's range */ 

uint32 value; /* contains at least 8 significant bits */ 
int bit count;  /* 4 of bits shifted out of 


value, at most 7 */ 
) bool decoder; 


/* Call this function before reading any bools from the 
partition. */ 


void init bool decoder (bool decoder "d, uint8 "start partition) 


0; /* value - first 2 input bytes */ 
while (++i << 2) 


d->value < (d->value << 8) | *start partitiontt; 


d->input = start partition; /* ptr to next byte to be read */ 

d-»range - 255; /* initial range is full */ 

d->bit count = 0; /* have not yet shifted out any bits */ 
} 


/* Main function reads a bool encoded at probability prob/256, 
which of course must agree with the probability used when the 
bool was written. */ 


int read bool(bool decoder *d, Prob prob) 
{ 
/* range and split are identical to the corresponding values 
used by the encoder when this bool was written */ 


uint32 split < 1 1 (((d->range - 1) * prob) >> 8); 
uint32 SPLIT split << 8; 
int retval; /* will be 0 or 1 %/ 
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if (d->value >= SPLIT) { /* encoded a one */ 


retval < 1; 

d->range -- split;  /% reduce range */ 

d->value -< SPLIT;  /% subtract off left endpoint of interval %/ 
) else { /* encoded a zero */ 

retval < 0; 

d->range = split; /* reduce range, no change in left endpoint */ 


} 


while (d->range < 128) { /* shift out irrelevant value bits */ 
d->value <<= 1; 
d->range <<= 1; 


if (++d->bit_count == 8) ( /* shift in new bits 8 at a time */ 


d->bit count < 0; 
d->value |= *d->input++; 
} 
} 
return retval; 


} 


/* Convenience function reads a "literal", that is, a "num bits"- 
wide unsigned value whose bits come high- to low-order, with 
each bit encoded at probability 128 (i.e., 1/2). */ 


uint32 read literal(bool decoder *d, int num bits) 


{ 
uint32 v = 0; 


while (num bits--) 
v = (v << 1) + read bool(d, 128); 
return v; 


} 
/* Variant reads a signed number */ 


int32 read signed literal(bool decoder *d, int num bits) 
{ 
int32 v = 0; 
if (!num bits) 
return 0; 
if (read bool(d, 128)) 


v = -l; 
while (--num_bits) 
v = (v << 1) + read_bool (d, 128); 


return v; 


meam Bnd Code block —--——---92--99--80-2--epy-uescesc-eesccpo 
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8. Compressed Data Components 


At the lowest level, VP8's compressed data is simply a sequence of 
probabilistically encoded bools. Most of this data is composed of 
(slightly) larger semantic units fashioned from bools, which we 
describe here. 


We sometimes use these descriptions in C expressions within data 
format specifications. In this context, they refer to the return 
value of a call to an appropriate bool decoder d, reading (as always) 
from its current reference point. 


4-------------- 4------- $ + 
| Call | Alt | Return 
4-------------- 4------- $ + 
Bool (p) B (p) Bool with probability p/256 of being 0. 
Return value of read bool(d, p). 
Flag F A one-bit flag (same thing as a B(128) or 
an L(1)). Abbreviated F. Return value of 
read bool(d, 128). 
Lit (n) L(n) Unsigned n-bit number encoded as n flags 


(a "literal").  Abbreviated L(n). The 
bits are read from high to low order. 
Return value of read literal(d, n). 


SignedLit (n) Signed n-bit number encoded similarly to 


an L(n). Return value of 
read signed literal(d, n). These are 
rare. 

P(8) An 8-bit probability. No different from 
an L(8), but we sometimes use this 
notation to emphasize that a probability 
is being coded. 

P (7) A 7-bit specification of an 8-bit 
probability.  Coded as an L(7) number x; 
the resulting 8-bit probability is x ? x 
xx 4 

F? X A flag that, if true, is followed by a 


piece of data X. 
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| F? X:Y | | A flag that, if true, is followed by X 

| | | and, if false, is followed by Y. Also | 

| used to express a value where Y is an | 
implicit default (not encoded in the data 

| | | stream), as in F? P(8):255, which 

| | | expresses an optional probability: If the | 

| | | flag is true, the probability is specified | 

| | | as an 8-bit literal, while if the flag is | 

| | | false, the probability defaults to 255. | 

| B(p? X | B(p)? | Variants of the above using a boolean | 

| | X:Y | indicator whose probability is not 

| | | necessarily 128. | 

| T | | Tree-encoded value from small alphabet. | 

+-------------- +------- 4-------------------------------------------- + 

The last type requires elaboration. We often wish to encode 

something whose value is restricted to a small number of 

possibilities (the alphabet). 

This is done by representing the alphabet as the leaves of a small 

binary tree. The (non-leaf) nodes of the tree have associated 

probabilities p and correspond to calls to read_bool(d, p). We think 


of a zero as choosing the left branch below the node and a one as 
choosing the right branch. 


Thus, every value (leaf) whose tree depth is x is decoded after 
exactly x calls to read_bool. 


A tree representing an encoding of an alphabet of n possible values 
always contains n-1 non-leaf nodes, regardless of its shape (this is 
easily seen by induction on n). 


There are many ways that a given alphabet can be so represented. The 
choice of tree has little impact on datarate but does affect decoder 
performance. The trees used by VP8 are chosen to (on average) 
minimize the number of calls to read bool. This amounts to shaping 
the tree so that values that are more probable have smaller tree 
depth than do values that are less probable. 


Readers familiar with Huffman coding will notice that, given an 
alphabet together with probabilities for each value, the associated 
Huffman tree minimizes the expected number of calls to read bool. 
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Such readers will also realize that the coding method described here 
never results in higher datarates than does the Huffman method and, 
indeed, often results in much lower datarates.  Huffman coding is, in 
fact, nothing more than a special case of this method in which each 
node probability is fixed at 128 (i.e., 1/2). 


8.1. Tree Coding Implementation 


We give a suggested implementation of a tree data structure followed 
by a couple of actual examples of its usage by VP8. 


It is most convenient to represent the values using small positive 
integers, typically an enum counting up from zero. The largest 
alphabet (used to code DCT coefficients, described in Section 13) 
that is tree-coded by VP8 has only 12 values. The tree for this 
alphabet adds 11 interior nodes and so has a total of 23 positions. 
Thus, an 8-bit number easily accommodates both a tree position and a 
return value. 


A tree may then be compactly represented as an array of (pairs of) 
8-bit integers. Each (even) array index corresponds to an interior 
node of the tree; the Oth index of course corresponds to the root of 
the tree. The array entries come in pairs corresponding to the left 
(0) and right (1) branches of the subtree below the interior node. 
We use the convention that a positive (even) branch entry is the 
index of a deeper interior node, while a nonpositive entry v 
corresponds to a leaf whose value is -v. 


The node probabilities associated to a tree-coded value are stored in 
an array whose indices are half the indices of the corresponding tree 
positions. The length of the probability array is one less than the 

Size of the alphabet. 


Here is C code implementing the foregoing. The advantages of our 
data structure should be noted. Aside from the smallness of the 
structure itself, the tree-directed reading algorithm is essentially 
a single line of code. 
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===> Begin code, block ============>=+2===2=+ +32 742232 == 
/* A tree specification is simply an array of 8-bit integers. */ 


typedef int8 tree index; 
typedef const tree index Tree[]; 


/* Read and return a tree-coded value at the current decoder 
position. */ 


int treed_read ( 
bool_decoder * const d, /* bool_decoder always returns a 0 or 1 */ 


Tree t, /* tree specification */ 

const Prob p[] /* corresponding interior node probabilities */ 
) 4 

register tree_index i = 0; /* begin at root */ 


/* Descend tree until leaf is reached */ 
while ((i = t[ i + read bool(d, p[i>>1])]) > 0) {} 


return -i; /* return value is negation of nonpositive index */ 


} 
Sesh End code block === ===: omo smo oo mni! 


Tree-based decoding is implemented in the reference decoder file 
bool decoder.h (Section 20.2). 


8.2. Tree Coding Example 


As a multi-part example, without getting too far into the semantics 
of macroblock decoding (which is of course taken up below), we look 
at the "mode" coding for intra-predicted macroblocks. 


It so happens that, because of a difference in statistics, the Y (or 
luma) mode encoding uses two different trees: one for key frames and 


another for interframes. This is the only instance in VP8 of the 
same dataset being coded by different trees under different 
circumstances. The UV (or chroma) modes are a proper subset of the Y 


modes and, as such, have their own decoding tree. 
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set Begin code. block sze o 5-5 to -5- 55 2222 o obo seas 


typedef enum 
DC PRED, /* predict DC using row above and column to the left */ 
V PRED, /* predict rows using row above */ 
H PRED, /* predict columns using column to the left */ 
TM PRED, /* propagate second differences a la "True Motion" */ 


B PRED, /* each Y subblock is independently predicted */ 


num uv modes = B PRED, /* first four modes apply to chroma */ 
num ymodes /* all modes apply to luma */ 
} 


intra_mbmode; 


/* The aforementioned trees together with the implied codings as 
comments. 
Actual (i.e., positive) indices are always even. 
Value (i.e., nonpositive) indices are arbitrary. */ 


const tree index ymode tree [2 * (num ymodes - 1)] = 
{ 
-DC PRED, 2, /* root: DC PRED < "0", "1" subtree */ 
4, 6, /* "1" subtree has 2 descendant subtrees */ 
-V PRED, -H PRED, /* "10" subtree: V PRED - "100", 
H PRED < "101" */ 
-TM PRED, -B PRED /* "11" subtree: TM PRED < "110", 
B PRED < "111" */ 
y; 
const tree index kf ymode tree [2 * (num ymodes - 1)] = 
{ 
-B PRED, 2, /* root: B PRED < "0", "1" subtree */ 
4, 6, /* "1" subtree has 2 descendant subtrees */ 
-DC PRED, -V PRED, /* "10" subtree: DC PRED - "100", 
V PRED < "101" %/ 
—-H PRED, -TM PRED /* "11" subtree: H PRED < "110", 
TM PRED < "111" %/ 


y; 
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const tree index uv mode tree [2 * (num uv modes - 1)] - 
{ 
-DC PRED, 2, /* root: DC PRED < "0", "1" subtree */ 
-V PRED, 4, /* "1" subtree: V PRED = "10", 
"11" subtree */ 
-H PRED, -TM PRED /* "11" subtree: H PRED < "110", 
TM PRED < "111" */ 


y; 
/* Given a bool decoder d, a Y mode might be decoded as follows. */ 


const Prob pretend its huffman [num ymodes - 1] - 
( 128, 128, 128, 128); 


Ymode = (intra mbmode) treed read(d, ymode tree, 
pretend its huffman); 


ss-cuEndocode.Dblóóck =5++=======+=99H=Si === tRSs= 


Since it greatly facilitates re-use of reference code, and since 
there is no real reason to do otherwise, it is strongly suggested 
that any decoder implementation use exactly the same enumeration 
values and probability table layouts as those described in this 
document (and in the reference code) for all tree-coded data in VP8. 


Frame Header 


The uncompressed data chunk at the start of each frame and at the 
first part of the first data partition contains information 
pertaining to the frame as a whole. We list the fields in the order 
of occurrence. Most of the header decoding occurs in the reference 
decoder file dixie.c (Section 20.4). 


.1. Uncompressed Data Chunk 


The uncompressed data chunk comprises a common (for key frames and 
interframes) 3-byte frame tag that contains four fields, as follows: 


1. A 1-bit frame type (0 for key frames, 1 for interframes). 
2. A 3-bit version number (0 - 3 are defined as four different 


profiles with different decoding complexity; other values may be 
defined for future variants of the VP8 data format). 
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3. A 1-bit show frame flag (0 when current frame is not for display, 
1 when current frame is for display). 


4. A 19-bit field containing the size of the first data partition in 
bytes. 


The version number setting enables or disables certain features in 
the bitstream, as follows: 


4--------- AZ b-----------—— * 
| Version | Reconstruction Filter | Loop Filter | 
4--------- AO b-----------— * 
| o | Bicubic | Normal | 
| | | | 
| 1 | Bilinear | Simple | 
| 2 | Bilinear None 

| | | | 
| 3 | None | None | 
| | | | 
| Other | Reserved for future use | 

4--------- === == 9989 ==5 825352 $ + 


The reference software also adjusts the loop filter based on version 
number, as per the table above. Version number 1 implies a "simple" 
loop filter, and version numbers 2 and 3 imply no loop filter. 
However, the "simple" filter setting in this context has no effect 
whatsoever on the decoding process, and the "no loop filter" setting 
only forces the reference encoder to set filter level equal to O0. 
Neither affect the decoding process. In decoding, the only loop 
filter settings that matter are those in the frame header. 


For key frames, the frame tag is followed by a further 7 bytes of 
uncompressed data, as follows: 


+=2>"Beg91n code. block: -ss+ss ss Shans a ates Sse 22222225 


Start code byte 0 0x9d 
Start code byte 1 0x01 
Start code byte 2 0x2a 
16 bits : (2 bits Horizontal Scale «« 14) | Width (14 bits) 
16 bits : (2 bits Vertical Scale «« 14) | Height (14 bits) 


Hoh ENA (Code. DIOOk SS A a z RA 
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The following source code segment illustrates validation of the start 
code and reading the width, height, and scale factors for a key 
frame. 
==== Begin- code block. zet sot soo o s s-o-- 5 ------— 
unsigned char *c = pbi->sourcet3; 
// vet via sync code 
if (c[0]!<0x9d||c[1]!<0x01||c[2] !<0x2a) 
return -l; 
===> End eode block =========S=22 =P == 2 SH ss 
Where pbi->source points to the beginning of the frame. 


The following code reads the image dimension from the bitstream: 


=== Begin code block! Zreč ooo 2222922222237 


pc-»Width = swap2(*(unsigned short") (c*3)) &0x3fff; 
pc->horiz scale = swap2(* (unsigned short") (ct3))>>14; 
pc->Height = swap2 (" (unsigned short") (c*5)) &0x3fff; 
pc-»vert scale = swap2(" (unsigned short") (c+5))>>14; 


=== blid code block, === S o SH 22992258 +95 


Where the swap2 macro takes care of the endian on a different 
platform: 


s= Begin “code block. ---29--9-5----—------e---—.2----- o 


dif defined( ppc ) 

f define swap2(d) \ 
((d&0x000000££)««8) | \ 
((ds0x0000££00)>>8) 

felse 
4 define swap2(d) d 

fendif 


Sene ENA CdS block a -S aaa R Sa a Sa Ses 
While each frame is encoded as a raster scan of 16x16 macroblocks, 
the frame dimensions are not necessarily evenly divisible by 16. In 


this case, write ew = 16 - (width & 15) and eh = 16 - (height & 15) 
for the excess width and height, respectively. Although they are 
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encoded, the last ew columns and eh rows are not actually part of the 
image and should be discarded before final output. However, these 
"excess pixels" should be maintained in the internal reconstruction 
buffer used to predict ensuing frames. 


The scaling specifications for each dimension are encoded as follows. 


d-----—— AZ A E E + 
| Value | Scaling | 
Ho Fann E N E a + 
o) | No upscaling (the most common case). | 
| | | 
|1 | Upscale by 5/4. | 
| | | 
| 2 | Upscale by 5/3. 
3 Upscale by 2. 
Ho AZ O O AAA EAE + 


Upscaling does not affect the reconstruction buffer, which should be 
maintained at the encoded resolution. Any reasonable method of 
upsampling (including any that may be supported by video hardware in 
the playback environment) may be used. Since scaling has no effect 
on decoding, we do not discuss it any further. 


As discussed in Section 5, allocation (or re-allocation) of data 
structures (such as the reconstruction buffer) whose size depends on 


dimension will be triggered here. 


9.2. Color Space and Pixel Type (Key Frames Only) 


4------- 4------------------------------------------ 4 
| Field | Value | 
4------- 4------------------------------------------ t 
| L(1) | 1-bit color space type specification | 
| L(1) | 1-bit pixel value clamping specification | 
+------- 4p------------------------------------------ + 

The color space type bit is encoded as follows: 

o 0 - YUV color space similar to the YCrCb color space defined in 


[ITU-R BT.601] 


o 1 - Reserved for future use 
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The pixel value clamping type bit is encoded as follows: 


o 0 - Decoders are required to clamp the reconstructed pixel values 
to between 0 and 255 (inclusive). 


o 1 - Reconstructed pixel values are guaranteed to be between 0 and 
255; no clamping is necessary. 


Information in this subsection does not appear in interframes. 
9.3.  Segment-Based Adjustments 


This subsection contains probability and value information for 
implementing segment adaptive adjustments to default decoder 
behavior. The data in this subsection is used in the decoding of the 
ensuing per-segment information and applies to the entire frame. 

When segment adaptive adjustments are enabled, each macroblock will 
be assigned a segment ID.  Macroblocks with the same segment ID 
belong to the same segment and have the same adaptive adjustments 
over default baseline values for the frame. The adjustments can be 
guantizer level or loop filter strength. 


The context for decoding this feature at the macroblock level is 
provided by a subsection in the frame header, which contains: 


1. A segmentation enabled flag that enables the feature for this 
frame if set to 1, and disables it if set to 0. The following 
fields occur if the feature is enabled. 


2. L(1) indicates if the segment map is updated for the current 
frame (update mb segmentation map). 


3. L(1) indicates if the segment feature data items are updated for 
the current frame (update segment feature data). 


4. If Item 3 above (update segment feature data) is 1, the following 
fields occur: 


a. L(1), the mode of segment feature data 


(segment feature mode), can be absolute-value mode (0) or 
delta value mode (1). 
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b. Segment feature data items are decoded segment by segment for 
each segment feature. For every data item, a one-bit flag 
indicates whether the item is 0, or a non-zero value to be 


decoded. If the value is non-zero, then the value is decoded 
as a magnitude L(n), followed by a one-bit sign (L(1) -- 0 
for positive and 1 for negative). The length n can be looked 


up from a pre-defined length table for all feature data. 


5. If the L(1) flag as noted in Item 2 above is set to 1, the 
probabilities of the decoding tree for the segment map are 
decoded from the bitstream. Each probability is decoded with a 
one-bit flag indicating whether the probability is the default 
value of 255 (flag is set to 0), or an 8-bit value, L(8), from 
the bitstream. 


The layout and semantics supporting this feature at the macroblock 
level are described in Section 10. 


9.4. Loop Filter Type and Levels 


VP8 supports two types of loop filters having different computational 
complexity. The following bits occur in the header to support the 
selection of the baseline type, strength, and sharpness behavior of 
the loop filter used for the current frame. 


4+------- 4------------------- + 
| Index | Description | 
4+------- 4------------------- + 
| L(1) | filter type | 
| | | 
| L(6) | loop_filter_level | 
| | | 
| L(3) | sharpness level | 
4+------- 4------------------- + 


The meaning of these numbers will be further explained in Section 15. 


VP8 has a feature in the bitstream that enables adjustment of the 
loop filter level based on a macroblock’s prediction mode and 
reference frame. The per-macroblock adjustment is done through delta 
values against the default loop filter level for the current frame. 
This subsection contains flag and value information for implementing 
per-macroblock loop filter level adjustment to default decoder 
behavior. The data in this section is used in the decoding of the 
ensuing per-macroblock information and applies to the entire frame. 
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L(1) is a one-bit flag indicating if the macroblock loop filter 
adjustment is on for the current frame. 0 means that such a feature 
is not supported in the current frame, and 1 means this feature is 
enabled for the current frame. 


Whether the adjustment is based on a reference frame or encoding 
mode, the adjustment of the loop filter level is done via a delta 
value against a baseline loop filter value. The delta values are 
updated for the current frame if an L(1) bit, 

mode ref lf delta update, takes the value 1. There are two groups of 
delta values: One group of delta values is for reference frame-based 
adjustments, and the other group is for mode-based adjustments. The 
number of delta values in the two groups is MAX REF LF DELTAS and 
MAX MODE LF DELTAS, respectively. For every value within the two 
groups, there is a one-bit L(1) to indicate if the particular value 
is updated. When one is updated (1), it is transmitted as a six-bit- 
magnitude L(6) followed by a one-bit sign flag (L(1) -- 0 for 
positive and 1 for negative). 


9.5. Token Partition and Partition Data Offsets 


VP8 allows DCT coefficients to be packed into multiple partitions, 
besides the first partition with header and per-macroblock prediction 
information, so the decoder can perform parallel decoding in an 
efficient manner. A two-bit L(2) is used to indicate the number of 
coefficient data partitions within a compressed frame. The two bits 
are defined in the following table: 


4------- 4------- 4---------------------- * 
| Bit 1 | Bit 0 | Number of Partitions | 
4------- 4------- 4---------------------- * 
| 0 | 0 E! | 
| | | | 
eee ie 

edep 
lea | 1 | 8 | 
4------- 4------- 4---------------------- * 


Offsets are embedded in the bitstream to provide the decoder direct 


access to token partitions. If the number of data partitions is 
greater than 1, the size of each partition (except the last) is 
written in 3 bytes (24 bits). The size of the last partition is the 


remainder of the data not used by any of the previous partitions. 
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The partitioned data are consecutive in the bitstream, so the size 
can also be used to calculate the offset of each partition. The 
following pseudocode illustrates how the size/offset is defined by 
the three bytes in the bitstream. 


=== Begin code: DioR === +22 2997222929 ET 


Offset/size =  (uint32) (byte0) + ((uint32) (bytel)<<8) 
+ ((uint32) (byte2)<<16); 


e RS ÓN 
9.6. Dequantization Indices 


All residue signals are specified via a quantized 4x4 DCT applied to 
the Y, U, V, or Y2 subblocks of a macroblock. As detailed in 

Section 14, before inverting the transform, each decoded coefficient 
is multiplied by one of six dequantization factors, the choice of 
which depends on the plane (Y, chroma = U or V, Y2) and coefficient 
position (DC = coefficient 0, AC = coefficients 1-15). The six 
values are specified using 7-bit indices into six corresponding fixed 
tables (the tables are given in Section 14). 


The first 7-bit index gives the dequantization table index for 
Y-plane AC coefficients, called yac gi. It is always coded and acts 
as a baseline for the other 5 quantization indices, each of which is 
represented by a delta from this baseline index.  Pseudocode for 
reading the indices follows: 


===) Begin code block: čes TE mm 5 - xa ae esas 
yac gi = L(7); /* Y ac index always specified */ 


ydc delta = F? delta(): 0; /* Y dc delta specified if 
flag is true */ 


y2dc delta = F? delta(): 0; /* Y2 dc delta specified if 
flag is true */ 
y2ac delta = F? delta(): 0; /* Y2 ac delta specified if 


flag is true */ 


uvdc delta < F? delta(): 0; /* chroma dc delta specified 
if flag is true */ 
uvac delta < F? delta(): 0; /* chroma ac delta specified 


if flag is true */ 


ecco enc Code: block SSssS25=S- SSeS assess SSeS SSS Sse SSS = 
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Where delta() is the process to read 5 bits from the bitstream to 
determine a signed delta value: 


4------- HO O + 
| Index | Description | 
+-----—- HO + 
| L(4) | Magnitude of delta 

| | | 
| L(1) | Sign of delta, 0 for positive and 1 for negative | 
4------- HO + 


9.7. Refresh Golden Frame and Altref Frame 


For key frames, both the golden frame and the altref frame are 
refreshed/ replaced by the current reconstructed frame, by default. 
For non-key frames, VP8 uses two bits to indicate whether the two 
frame buffers are refreshed, using the reconstructed current frame: 


d------- A A T A A E r A A EA A Er a A T a T E + 

| Index | Description | 

d------- dulce MEET T men + 
L(1) Whether golden frame is refreshed (0 for no, 1 for yes). 

| L(1) | Whether altref frame is refreshed (0 for no, 1 for yes). | 

qe REGII Shoe SSS Ses SoSH Se URS SE SS Se eS SS IS A + 


When the flag for the golden frame is 0, VP8 uses 2 more bits in the 
bitstream to indicate whether the buffer (and which buffer) is copied 
to the golden frame, or if no buffer is copied: 


4+------- A + 
| Index | Description | 
4+------- HZ + 
| L(2) | Buffer copy flag for golden frame buffer | 
4+------- 4------------------------------------------ + 


Where: 

o 0 means no buffer is copied to the golden frame 

o 1 means last frame is copied to the golden frame 

o 2 means alt ref frame is copied to the golden frame 


Similarly, when the flag for altref is 0, VP8 uses 2 bits in the 
bitstream to indicate which buffer is copied to alt ref frame. 
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4+------- HZ + 
| Index | Description | 
4+------- A + 
| L(2) | Buffer copy flag for altref frame buffer | 
4+------- 4------------------------------------------ + 


Where: 

o 0 means no buffer is copied to the altref frame 

o 1 means last frame is copied to the altref frame 

o 2 means golden frame is copied to the altref frame 


Two bits are transmitted for ref frame sign bias for golden frame and 
alt ref frame, respectively. 


4+------- 4--------------------------------- + 
| Index | Description | 
4+------- 4--------------------------------- + 
| L(1) | Sign bias flag for golden frame | 
L(1) Sign bias flag for altref frame 
Ho 4--------------------------------- + 


These values are used to control the sign of the motion vectors when 
a golden frame or an altref frame is used as the reference frame for 
a macroblock. 


9.8. Refresh Last Frame Buffer 


VP8 uses one bit, L(1), to indicate if the last frame reference 
buffer is refreshed using the constructed current frame. Ona key 
frame, this bit is overridden, and the last frame buffer is always 
refreshed. 


9.9. DCT Coefficient Probability Update 


This field contains updates to the probability tables used to decode 
DCT coefficients. For each of the probabilities in the tables, there 
is an L(1) flag indicating if the probability is updated for the 
current frame, and if the L(1) flag is set to 1, there follows an 
additional 8-bit value representing the new probability value. These 
tables are maintained across interframes but are of course replaced 
with their defaults at the beginning of every key frame. 


The layout and semantics of this field will be taken up in 
Section 13. 
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9.10. Remaining Frame Header Data (Non-Key Frame) 


mb no skip coeff. This flag indicates at the frame level 
if skipping of macroblocks with no non-zero coefficients 
is enabled. If it is set to 0, then prob skip false is 
not read and mb skip coeff is forced to 0 for all 
macroblocks (see Sections 11.1 and 12.1). 


prob skip false = probability used for decoding a 
macroblock-level flag, which indicates if a macroblock 
has any non-zero coefficients. Only read if 

mb no skip coeff is 1. 


prob intra < probability that a macroblock is "intra" 
predicted (that is, predicted from the already-encoded 
portions of the current frame), as opposed to "inter" 
predicted (that is, predicted from the contents of a 
prior frame). 


+ 
| 
+ 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| prob_last = probability that an inter-predicted 
| macroblock is predicted from the immediately previous 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
+ 


altref frame. 


prob_gf = probability that an inter-predicted macroblock 
is predicted from the most recent golden frame, as 
opposed to the altref frame. 


If true, followed by four L(8)s updating the 
probabilities for the different types of intra-prediction 
for the Y plane. These probabilities correspond to the 
four interior nodes of the decoding tree for intra-Y 
modes in an interframe, that is, the even positions in 
the ymode_tree array given above. 


If true, followed by three L(8)s updating the 

probabilities for the different types of intra-prediction 
for the chroma planes. These probabilities correspond to 
the even positions in the uv_mode_tree array given above. 


Motion vector probability update. Details are given in 


| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
frame, as opposed to the most recent golden frame or | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
Section 17.2, "Probability Updates". 
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9. 


10. 


Decoding of this portion of the frame header is handled in the 
reference decoder file dixie.c (Section 20.4). 


11. Remaining Frame Header Data (Key Frame) 


mb no skip coeff. This flag indicates at the frame level 
if skipping of macroblocks with no non-zero coefficients 
is enabled. If it is set to 0, then prob skip false is 
not read and mb skip coeff is forced to 0 for all 
macroblocks (see Sections 11.1 and 12.1). 


has any non-zero coefficients. Only read if 


prob skip false - Probability used for decoding a 
mb no skip coeff is 1. 


| 
| 
| 
| 
| macroblock-level flag, which indicates if a macroblock 
| 
+ 


Decoding of this portion of the frame header is handled in the 
reference decoder file modemv.c (Section 20.11). 


This completes the layout of the frame header. The remainder of the 
first data partition consists of macroblock-level prediction data. 


After the frame header is processed, all probabilities needed to 
decode the prediction and residue data are known and will not change 
until the next frame. 


Segment-Based Feature Adjustments 


Every macroblock may optionally override some of the default 
behaviors of the decoder. Specifically, VP8 uses segment-based 
adjustments to support changing quantizer level and loop filter level 
for a macroblock. When the segment-based adjustment feature is 
enabled for a frame, each macroblock within the frame is coded with a 
segment id. This effectively segments all the macroblocks in the 
current frame into a number of different segments.  Macroblocks 
within the same segment behave exactly the same for quantizer and 
loop filter level adjustments. 


If both the segmentation enabled and update mb segmentation map flags 
in subsection B of the frame header take a value of 1, the prediction 
data for each (intra- or inter-coded) macroblock begins with a 
Specification of segment id for the current macroblock. It is 
decoded using this simple tree 
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===="Begin code, block zs-e ooo ooo o +22 +22 


const tree index mb segment tree [2 * (4-1)] < 


Qua x45 /* root: "0", "1" subtrees */ 
-0, -1, /* "00" < Oth value, "01" - 1st value */ 
A 3 /* "10" - 2nd value, "11" - 3rd value */ 


+25 End code block reče so Si HH esse sae ice?! 


combined with a 3-entry probability table, 
mb_segment_tree_probs[3]. The macroblock’s segment_id is used later 
in the decoding process to look into the segment_feature_data table 
and determine how the quantizer and loop filter levels are adjusted. 


The decoding of segment_id, together with the parsing of 


intra-prediction modes (which is taken up next), is implemented in 
the reference decoder file modemv.c. 


11. Key Frame Macroblock Prediction Records 
After specifying the features described above, the macroblock 
prediction record next specifies the prediction mode used for the 
macroblock. 

11.1. mb_skip_coeff 
The single bool flag is decoded using prob_skip_false if and only if 
mb no skip coeff is set to 1 (see Sections 9.10 and 9.11). If 
mb no skip coeff is set to 0, then this value defaults to 0. 

11.2. Luma Modes 
First comes the luma specification of type intra_mbmode, coded using 


the kf_ymode_tree, as described in Section 8 and repeated here for 
convenience: 


Bankoski, et al. Informational [Page 42] 


RFC 6386 VP8 Data Format and Decoding Guide November 2011 


set Begin code. block sze o 5-5 to -5- 55 2222 o obo seas 


typedef enum 
DC PRED, /* predict DC using row above and column to the left */ 
V PRED, /* predict rows using row above */ 
H PRED, /* predict columns using column to the left */ 
TM PRED, /* propagate second differences a la "True Motion" */ 


B PRED, /* each Y subblock is independently predicted */ 


num uv modes = B PRED, /* first four modes apply to chroma */ 
num ymodes /* all modes apply to luma */ 


} 


intra_mbmode; 


const tree index kf_ymode_tree [2 * (num ymodes - 1)] = 
{ 
-B PRED, 2, /* root: B PRED < "0", "1" subtree */ 
4, 6, /* "1" subtree has 2 descendant subtrees */ 
-DC PRED, -V PRED, /* "10" subtree: DC PRED < "100", 
V PRED < "101" */ 
—-H PRED, -TM PRED /* "11" subtree: H PRED < "110", 
TM PRED < "111" %/ 


n 
===> End Cde block 'S==3=5=+=2 52992229 9 SH 295 


For key frames, the Y mode is decoded using a fixed probability array 
as follows: 


Spo (BEGIN Code DLOCK Store EI SO SA > [tt 


const Prob kf ymode prob [num ymodes - 1] = ( 145, 156, 163, 128); 
Ymode = (intra mbmode) treed read(d, kf ymode tree, kf ymode prob); 


==== End, code Dlock.--======9===+2=292= a 


d is of course the bool_decoder being used to read the first data 
partition. 


If the Ymode is B_PRED, it is followed by a (tree-coded) mode for 


each of the 16 Y subblocks. The 10 subblock modes and their coding 
tree are as follows: 
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set Begin code block sze os o 5-5 55m >>. o odo Os soo 


typedef enum 

B DC PRED, /* predict DC using row above and column 
to the left */ 

B TM PRED, /* propagate second differences a la 
"True Motion" */ 


B VE PRED, /* predict rows using row above */ 
B HE PRED, /* predict columns using column to the left */ 


B LD PRED, /* southwest (left and down) 45 degree diagonal 
prediction */ 
B RD PRED, /* southeast (right and down) "" */ 


B VR PRED, /* SSE (vertical right) diagonal prediction %/ 


( 
B VL PRED, /* SSW (vertical left) "" */ 
B HD PRED, /* ESE (horizontal down) "" */ 
B HU PRED, /* ENE (horizontal up) "" */ 


num intra bmodes 


} 


intra_bmode; 


/* Coding tree for the above, with implied codings as comments */ 


const tree index bmode tree [2 * (num intra bmodes - 1)] = 
{ 
-B DC PRED, 2, /* B DC PRED = "0" */ 
-B TM PRED, 4, /* B TM PRED < "10" */ 
-B VE PRED, 6, /* B VE PRED = "110" */ 
8, 12, 
-B HE PRED, 10, /* B HE PRED < "11100" */ 
-B RD PRED, -B VR PRED, /* B RD PRED < "111010", 
B VR PRED = "111011" */ 
-B LD PRED, 14, /* B LD PRED = "111110" */ 
-B VL PRED, 16, /* B VL PRED < "1111110" */ 
-B HD PRED, -B HU PRED ZE HAD: = TITO", 
HU < "11111111" %/ 


y 
==" nO ese block ¡== aS O eque 
The first four modes are smaller versions of the similarly named 


16x16 modes above, albeit with slightly different numbering. The 
last six "diagonal" modes are unique to luma subblocks. 
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11. 


Subblock Mode Contexts 


The coding of subblock modes in key frames uses the modes already 
coded for the subblocks to the left of and above the subblock to 
select a probability array for decoding the current subblock mode. 
This is our first instance of contextual prediction, and there are 
Several caveats associated with it: 


The adjacency relationships between subblocks are based on the 
normal default raster placement of the subblocks. 


The adjacent subblocks need not lie in the current macroblock. 
The subblocks to the left of the left-edge subblocks 0, 4, 8, and 
12 are the right-edge subblocks 3, 7, 11, and 15, respectively, 
of the (already coded) macroblock immediately to the left. 
Similarly, the subblocks above the top-edge subblocks 0, 1, 2, 
and 3 are the bottom-edge subblocks 12, 13, 14, and 15 of the 
already-coded macroblock immediately above us. 


For macroblocks on the top row or left edge of the image, some of 
the predictors will be non-existent. Such predictors are taken 
to have had the value B DC PRED, which, perhaps conveniently, 
takes the value 0 in the enumeration above. A simple management 
scheme for these contexts might maintain a row of above 
predictors and four left predictors. Before decoding the frame, 
the entire row is initialized to B DC PRED; before decoding each 
row of macroblocks, the four left predictors are also set to 

B DC PRED. After decoding a macroblock, the bottom four subblock 
modes are copied into the row predictor (at the current position, 
which then advances to be above the next macroblock), and the 
right four subblock modes are copied into the left predictor. 


Many macroblocks will of course be coded using a 16x16 luma 
prediction mode. For the purpose of predicting ensuing subblock 
modes (only), such macroblocks derive a subblock mode, constant 
throughout the macroblock, from the 16x16 luma mode as follows: 
DC PRED uses B DC PRED, V PRED uses B VE PRED, H PRED uses 

B HE PRED, and TM PRED uses B TM PRED. 


Although we discuss interframe modes in Section 16, we remark 
here that, while interframes do use all the intra-coding modes 
described here and below, the subblock modes in an interframe are 
coded using a single constant probability array that does not 
depend on any context. 


The dependence of subblock mode probability on the nearby subblock 
mode context is most easily handled using a three-dimensional 
constant array: 
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===="Begin code. block e2----—--2--2u———5-esu-o0s-e 222 


const Prob kf bmode prob [num intra bmodes] [num intra bmodes] 
[num intra bmodes-1]; 


-—-——pPiEndgvecode DOCK == ES Se qup ee 


The outer two dimensions of this array are indexed by the already- 
coded subblock modes above and to the left of the current block, 
respectively. The inner dimension is a typical tree probability list 
whose indices correspond to the even indices of the bmode tree above. 
The mode for the j^(th) luma subblock is then 


vers Begin code block ======2===5===5 === 25235 


Bmode = (intra_bmode) treed_read(d, bmode_tree, kf_bmode_prob 
[A] [L]); 


So ene code. DVO CK 53235522259 HR a ee 

Where the 4x4 Y subblock index j varies from 0 to 15 in raster order, 
and A and L are the modes used above and to the left of the j% (th) 
subblock. 


The contents of the kf bmode prob array are given at the end of this 
section. 


11.4. Chroma Modes 
After the Y mode (and optional subblock mode) specification comes the 
chroma mode. The chroma modes are a subset of the Y modes and are 
coded using the uv mode tree, as described in Section 8 and repeated 


here for convenience: 


====""Begán ¡code block .=======3====2=2392+===2======HPS 22255 


const tree index uv mode tree [2 * (num uv modes - 1)] = 
{ 
-DC PRED, 2, /* root: DC PRED < "0", "1" subtree */ 
-V PRED, 4, /* "1" subtree: V PRED = "10", 
"11" subtree */ 
-H PRED, -TM PRED /* "11" subtree: H PRED < "110", 
TM PRED < "111" */ 


y; 


===> send. Code: block, čeri ooo HS ESSE TR SRA EE SE 
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TE. 


As for the Y modes (in a key frame), the chroma modes are coded using 
a fixed, contextless probability table: 


4 == Begin code block: ="=235=2235+222 2222529222522 22585 


const Prob kf uv mode prob [num uv modes - 1] = ( 142, 114, 183); 
uv mode - (intra mbmode) treed read(d, uv mode tree, 
kf uv mode prob); 


+25 Bnd. code block, ze SES HH reum esos 


This completes the description of macroblock prediction coding for 
key frames. As will be discussed in Section 16, the coding of intra 
modes within interframes is similar, but not identical, to that 
described here (and in the reference code) for prediction modes and, 
indeed, for all tree-coded data in VP8. 


5. Subblock Mode Probability Table 


Finally, here is the fixed probability table used to decode subblock 
modes in key frames. 


===> Begin code block -s-e-Et Eo 555 5m 5 IP: 6 Se == 


const Prob kf bmode prob [num intra bmodes] [num intra bmodes] 
[num intra bmodes-1] - 
{ 
{ 
231, 120, .48, 89; 115, IT3, 120, 152, 112], 
152, 179, 64, 126, 170, 118, 46, 70, 95), 
175, 69, 143, 80, 85, 82, 72, 155, 103), 
56, “58, 10; Lily 218, 189, 17; 13, 152]; 
144, 71, 10, 38, 171, 213, 144, 34, 26}, 
114, .:26, -L77 163; 44, 195, 21; 10, 173] 
121, 24, 80, 195, 26, 62, 44, 64, 85), 
170, 46, 55, 19, 136, 160, 33, 206, 71], 
63, 20, 8, 114, 114, 208, 12, 9, 226}, 
81, 40» :11, 96, 182, 84, 29,  l6,- 36) 


—— mama cA DA cO cS c oc oc 


), 
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134, 
72, 
66, 
41, 

104, 
74, 
65, 
87, 
47, 
66, 


aaa cA ama c cS cs 


88, 
43, 
39, 
56, 
107, 
39, 
34, 
68, 
34, 
62, 


—— A c a cA mo cS do cS cs 


593; 
60, 
112, 
40, 
100, 
88, 
61, 
142, 
41, 
sl, 


SAA do ao omo cS mo cS cs 


125, 
95, 
15, 
ST; 

115; 
38, 
41, 

LOL; 
57, 

117, 


AAA cA OA cS mo cS cs 


), 
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147, 
183, 
200, 


116, 
106, 


137, 
130, 

99, 
178, 

27, 
146, 
160, 

44, 
110, 
102, 


150, 
117, 
87, 
104, 
26, 
IE. 
206, 
22, 
102, 
95, 


159, 
172, 

85, 
196, 

43, 
140, 
155, 

16, 
102, 
168, 


88, 
89, 
47, 
ZA, 
10, 
121, 
138, 
10, 
102, 
36, 


98, 101, 106, 
Lorp 11%," 32, 


241, 141, 26, 


217, 2997 :87, 
73, 166, 49, 


182, 183, 21, 
197, 189, 23, 
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148), 
80], 
128], 
107), 
7}, 
157), 
128), 
23], 
194], 
22) 


205], 
61), 
171), 
TTk, 
31), 
64), 
73}, 
114), 
124), 
245 


TELE; 
111], 
114], 
109), 
71], 
154], 
209], 
171], 
221], 
82) 


82), 
45), 
1), 
49], 
6}, 
85}, 
114}, 
26}, 
43}, 
26} 


November 2011 


[Page 48] 


RFC 6386 
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), 
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138, 31, 
67, 87, 
635-759; 
40, 40, 
57, 46, 
47, 15, 
46, 17, 
65, 32, 
40, 3, 
87, 37, 

104, 55, 
64, 90, 
54, 57, 
30, 34y 
doy: 23.2 
395 19, 
3T, 9, 
88, 31, 
56, 21, 
Boj». 387 

102, 61, 
69, 60, 
68, 45, 
62, 17, 
75, i157 
37, 43, 
63, 97 
86, 6, 
56, 8, 
58, 157 

164, 50, 
51, 103, 
86, 40, 
22, 26, 
83, 12, 
45, 16; 
bo, 21, 
85, 26, 
18, 11, 
So, 27; 
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171, 
169, 
180, 
116, 
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183, 
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115, 
115, 
115, 
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27, 166, 38, 
82, 115, 26, 
59, 166, 93, 
143, 209, 34, 
128, lj BA} 
34, 223, 49, 
6, 98, l5; 
28, 128, 23, 
51, 192; 18, 
595. Ts 64, 
9, 54, 53, 
40, Al, 23, 
5, 41, 38, 
152, 116, 10, 
192, 255, 160; 
26, 114, 32, 
2, T5; T3 
L02;,. 857. 05; 
595.2055. 457 
73, 102, T, 
34, 53, 31, 
73, 119, 28, 
Tp- 47; nee 
146, 85, 55, 
64, 255, 184, 
100, 163, 85, 
28, 64, 32, 
64, 255, 25, 
1375-^255;,- 55; 
135, 57, 26; 
X54, 133;: 25; 
131, 123, 31, 
148, 224, 45, 
240, 154, 14, 
19272:2357 68, 
64, 222, T 
60, 138, 23, 
128, 128, 32, 
144, 171, 4, 
174; 171; 12, 
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226}, 
57}, 
213), 
134), 
51], 
255), 
73], 
85], 
192), 
98) 


192]; 
SU er 
171), 
70}, 
16}, 

1}, 
85}, 

1}, 
128}, 
40} 


DTE, 
158), 
128]; 
209), 
28], 
197}, 
213}, 
ITET; 
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190, 80, 35, 99, 180, 80, 126, 54, 45}, 
85, 126, 47, 87, 176, Bi, 41, 20, 32}, 
101, 75, 128, 139, 118, 146, 116, 128, 85}, 
56, 41, 15, 176, 236, 85, 37, 9, 62}, 
146, 36, 19, 30, 171, 255, 97, 27, 20}, 
71, 30, 17, 119, 118, 255, 17, 18, 138}, 
101, 38, 60, 138, 55, 70, 43, 26, 142}, 
138, 45, 6l, 62, 219, 1, 81, 188, 64}, 
32, 4l, 20, 117, 151, 142, 20, 21, 163}, 
112, 19, 12, 6l, 195, 128, 48, 4, 24} 


aaa OA ss c c c cs 


) 
y; 


=== End, code Dlock-======s=5=25=89=2=5=9923SSp9=2=59=3ep9+ 
Intraframe Prediction 


Intraframe prediction uses already-coded macroblocks within the 
current frame to approximate the contents of the current macroblock. 
It applies to intra-coded macroblocks in an interframe and to all 
macroblocks in a key frame. 


Relative to the current macroblock "M", the already-coded macroblocks 
include all macroblocks above M together with the macroblocks on the 
same row as, and to the left of, M, though at most four of these 
macroblocks are actually used: the block "A" directly above M, the 
blocks immediately to the left and right of A, and the block 
immediately to the left of M. 


Each of the prediction modes (i.e., means of extrapolation from 
already-calculated values) uses fairly simple arithmetic on pixel 
values whose positions, relative to the current position, are defined 
by the mode. 


The chroma (U and V) and luma (Y) predictions are independent of each 
other. 


The relative addressing of pixels applied to macroblocks on the upper 
row or left column of the frame will sometimes cause pixels outside 
the visible frame to be referenced. Usually such out-of-bounds 
pixels have an assumed value of 129 for pixels to the left of the 
leftmost column of the visible frame and 127 for pixels above the top 
row of the visible frame (including the special case of the pixel 
above and to the left of the top-left pixel in the visible frame). 
Exceptions to this (associated to certain modes) will be noted below. 
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12. 


12 


The already-coded macroblocks referenced by intra-prediction have 
been "reconstructed", that is, have been predicted and residue- 
adjusted (as described in Section 14), but have not been loop- 
filtered. While it does process the edges between individual 
macroblocks and individual subblocks, loop filtering (described in 
Section 15) is applied to the frame as a whole, after all of the 
macroblocks have been reconstructed. 


1. mb skip coeff 
The single bool flag is decoded using prob skip false if and only if 


mb no skip coeff is set to 1 (see Sections 9.10 and 9.11). If 
mb no skip coeff is set to 0, then this value defaults to 0. 


.2. Chroma Prediction 


The chroma prediction is a little simpler than the luma prediction, 
So we treat it first. Each of the chroma modes treats U and V 
identically; that is, the U and V prediction values are calculated in 
parallel, using the same relative addressing and arithmetic in each 
of the two planes. 


The modes extrapolate prediction values using the 8-pixel row "A" 
lying immediately above the block (that is, the bottom chroma row of 
the macroblock immediately above the current macroblock) and the 
8-pixel column "L" immediately to the left of the block (that is, the 
rightmost chroma column of the macroblock immediately to the left of 
the current macroblock). 


Vertical prediction (chroma mode V PRED) simply fills each 8-pixel 
row of the 8x8 chroma block with a copy of the "above" row (A). If 
the current macroblock lies on the top row of the frame, all 8 of the 
pixel values in A are assigned the value 127. 


Similarly, horizontal prediction (H PRED) fills each 8-pixel column 
of the 8x8 chroma block with a copy of the "left" column (L). If the 
current macroblock is in the left column of the frame, all 8 pixel 
values in L are assigned the value 129. 


DC prediction (DC PRED) fills the 8x8 chroma block with a single 
value. In the generic case of a macroblock lying below the top row 
and right of the leftmost column of the frame, this value is the 
average of the 16 (genuinely visible) pixels in the (union of the) 
above row A and left column L. 


Otherwise, if the current macroblock lies on the top row of the 
frame, the average of the 8 pixels in L is used; if it lies in the 
left column of the frame, the average of the 8 pixels in A is used. 
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Note that the averages used in these exceptional cases are not the 
same as those that would be arrived at by using the out-of-bounds A 
and L values defined for V PRED and H PRED. In the case of the 
leftmost macroblock on the top row of the frame, the 8x8 block is 
simply filled with the constant value 128. 


For DC PRED, apart from the exceptional case of the top-left 
macroblock, we are averaging either 16 or 8 pixel values to get a 
single prediction value that fills the 8x8 block. The rounding is 
done as follows: 


set Begin code block: ste oe - zoo or 29222292322 


int sum; /* sum of 8 or 16 pixels at (at least) 16-bit precision %/ 
int shf; /* base 2 logarithm of the number of pixels (3 or 4) */ 


Pixel DCvalue = (sum + (1 << (shf-1))) >> shf; 
==> uno code DIO Chi Sana Nm RE > z EP AS ae 


Because the summands are all valid pixels, no "clamp" is necessary in 
the calculation of DCvalue. 


The remaining "True Motion" (TM PRED) chroma mode gets its name from 
an older technigue of video compression used by On2 Technologies, to 


which it bears some relation. In addition to the row "A" and column 
"L", TM PRED uses the pixel "P" above and to the left of the chroma 
block. 
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The following figure gives an example of how TM PRED works: 


-4scwBegin-code block. -S sense assess heen e 
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+5 End code block; šarm ESTER nm nur SS ese 


and Ls represent reconstructed pixel values from 


previously coded blocks, 


As, 


Where P, 


and X00 through X77 represent predicted 


g eguation to 


TM PRED uses the followin 


values for the current block. 


calculate X i 


es 
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The exact algorithm is as follows: 
<< o Begin code block. =============22 2023222252232 


void TMpred( 


Pixel b[8][8], /* chroma (U or V) prediction block */ 

const Pixel A[8], /* row of already-constructed pixels 
above block */ 

const Pixel L[8], /* column of "" just to the left of 
block */ 

const Pixel P /* pixel just to the left of A and 


above L*/ 


int r = 0; /* row */ 
do { 
int c < 0; /* column */ 
do { 
b[r][c] = clamp255(L[r]t A[c] - P); 


} while (++c « 8); 
} while (++r < 8); 
} 


-ec—UEnd code block —--5-E ETER Tm ana See sass Seana 


Note that the process could equivalently be described as propagating 
the vertical differences between pixels in L (starting from P), using 
the pixels from A to start each column. 


An implementation of chroma intra-prediction may be found in the 
reference decoder file predict.c (Section 20.14). 


Unlike DC_PRED, for macroblocks on the top row or left edge, TM_PRED 
does use the out-of-bounds values of 127 and 129 (respectively) 
defined for V PRED and H PRED. 


12.3.  Luma Prediction 


The prediction processes for the first four 16x16 luma modes 

(DC PRED, V PRED, H PRED, and TM PRED) are essentially identical to 
the corresponding chroma prediction processes described above, the 
only difference being that we are predicting a single 16x16 luma 
block instead of two 8x8 chroma blocks. 


Thus, the row "A" and column "L" here contain 16 pixels, the DC 
prediction is calculated using 16 or 32 pixels (and shf is 4 or 5), 
and we of course fill the entire prediction buffer, that is, 16 rows 
(or columns) containing 16 pixels each. The reference implementation 
of 16x16 luma prediction is also in predict.c. 
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In the remaining luma mode (B PRED), each 4x4 Y subblock is 
independently predicted using one of ten modes (listed, along with 
their encodings, in Section 11). 


Also, unlike the full-macroblock modes already described, some of the 
subblock modes use prediction pixels above and to the right of the 
current subblock. In detail, each 4x4 subblock "B" is predicted 
using (at most) the 4-pixel column "L" immediately to the left of B 
and the 8-pixel row "A" immediately above B, consisting of the 4 
pixels above B followed by the 4 adjacent pixels above and to the 
right of B, together with the single pixel "P" immediately to the 
left of A (and immediately above L). 


For the purpose of subblock intra-prediction, the pixels immediately 
to the left and right of a pixel in a subblock are the same as the 
pixels immediately to the left and right of the corresponding pixel 
in the frame buffer "F". Vertical offsets behave similarly: The 
above row A lies immediately above B in F, and the adjacent pixels in 
the left column L are separated by a single row in F. 


Because entire macroblocks (as opposed to their constituent 
subblocks) are reconstructed in raster-scan order, for subblocks 
lying along the right edge (and not along the top row) of the current 
macroblock, the four "extra" prediction pixels in A above and to the 
right of B have not yet actually been constructed. 


Subblocks 7, 11, and 15 are affected. All three of these subblocks 
use the same extra pixels as does subblock 3 (at the upper right 
corner of the macroblock), namely the 4 pixels immediately above and 
to the right of subblock 3. Writing (R,C) for a frame buffer 
position offset from the upper left corner of the current macroblock 
by R rows and C columns, the extra pixels for all the right-edge 
subblocks (3, 7, 11, and 15) are at positions (-1,16), (-1,17), 


(-1,18), and (-1,19). For the rightmost macroblock in each 
macroblock row except the top row, the extra pixels shall use the 
same value as the pixel at position (-1,15), which is the rightmost 


visible pixel on the line immediately above the macroblock row. For 
the top macroblock row, all the extra pixels assume a value of 127. 


The details of the prediction modes are most easily described in 
code. 
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===> Begin code block sze osmo o 5-5 5 +22 9222 
/* Result pixels are often averages of two or three predictor 
pixels. The following subroutines are used to calculate 
these averages. Because the arguments are valid pixels, no 

clamping is necessary. An actual implementation would 
probably use inline functions or macros. */ 


/* Compute weighted average centered at y w/adjacent x, z */ 


Pixel avg3(Pixel x, Pixel y, Pixel z) ( 
return (x + y + y + z + 2) >> 2;) 


/* Weighted average of 3 adjacent pixels centered at p */ 
Pixel avg3p (const Pixel *p) ( return avg3(p[-1], pl0], pll]);) 
/* Simple average of x and y */ 

Pixel avg2(Pixel x, Pixel y) ( return (x + y + 1) >> 1;) 


/* Average of p[0] and p[1] may be considered to be a synthetic 
pixel lying between the two, that is, one half-step past p. */ 


Pixel avg2p(const Pixel *p) ( return avg2(p[0], p[1]);} 


void subblock intra predict( 


Pixel B[4][4], /* Y subblock prediction buffer */ 
const Pixel "A, /* A[0]...A[7] < above row, A[-1] < P */ 
const Pixel *L, /* L[0]...L[3] » left column, L[-1] < P %/ 
intra bmode mode /* enum is in Section 11.2 */ 
) 4 
Pixel E[9]; /* 9 already-constructed edge pixels */ 
E[0] = L[3]; Ell] = L[2]; EI2] = LI1]; EIS3] = L[0]; 
E[4] = A[-11]; /* == L[-1] == P */ 
E[S] = A[0]; E[6] = A[1]; EI[7] = A[2]; E[8] = A[3]; 
switch(mode) { 
/* First four modes are similar to corresponding 
full-block modes. */ 
case B_DC_PRED: 
{ 
int v = 4; /* DC sum/avg, 4 is rounding adjustment */ 
int i= 0; do (v += A[i] + L[il;} while (++i < 4); 
v >>= 3; /* averaging 8 pixels */ 
i = 0; do ( /* fill prediction buffer with constant DC 
value */ 
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int j = 0; do { B[i]lj] = v;) while (trj < 4); 
while (tti < 4); 
break; 


} 


case B TM PRED: /* just like 16x16 TM PRED */ 
{ 
int r = 0; do { 
int c = 0; do { 
B[r][c] = clamp255(L[r] + A[c] - A[-1]); 
} while (++c < 4); 
} while (++r < 4); 
break; 


} 


case B_VE_PRED: /* like 16x16 V_PRED except using averages */ 
{ 


int c = 0; do { /* all 4 rows = smoothed top row */ 

B[0] [c] = B[1][c] = B[2][c] = B[3] [c] = avg3p(A + c); 
} while (ttc < 4); 
break; 


} 


case B HE PRED: /* like 16x16 H_PRED except using averages */ 
{ 
/* Bottom row is exceptional because L[4] does not exist */ 
int r = 3; while (1) { /* all 4 columns = smoothed left 
column */ 


B[r][0] = B[r][1] = BIr][2] = Bir] [3] < v; 
if (--r < 0) 
break; 
v = avg3p(L * r); /* upper 3 rows use average of 


3 pixels */ 


/* The remaining six "diagonal" modes subdivide the 
prediction buffer into diagonal lines. All the pixels 
on each line are assigned the same value; this value is 
(a smoothed or synthetic version of) an 
already-constructed predictor value lying on the same 
line. For clarity, in the comments, we express the 
positions of these predictor pixels relative to the 
upper left corner of the destination array B. 
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These modes are unique to subblock prediction and have 


no full-block analogs. 
+|- 45 degrees from hori 


The first two use lines at 
zontal (or, equivalently, 


vertical), that is, lines whose slopes are +|- T 
case B LD PRED: /* southwest (left and down) step - 
(-1, 1) or (1,-1) */ 

/* avg3p(A + j) is the "smoothed" pixel at (-l,j) %/ 
B[0][0] < avg3p(A t 1); 
B[0][1] = B[1][0] = avg3p(A + 2); 
B[0] [2] = B[1][1] = B[2][0] = avg3p(A + 3); 
B[0][3] = B[1] [2] = B[2][1] = B[3][0] = avg3p(A + 4); 
B[1] [3] = B[2] [2] = B[3] [1] = avg3p(A + 5); 
B[2][3] = B[3][2] = avg3p(A + 6); 
B[3] [3] = avg3(A[6], A[7], A[7]); /* A[8] does not exist */ 
break; 

case B_RD_PRED: /* southeast (right and down) step = 

(Xl) or- (ELL) *7 
B[3] [0] = avg3p(E + 1); /* predictor is from (2, -1) */ 
B[3][1] = B[2][0] = avg3p(E + 2); /* (1, -1) */ 
B[3] [2] = B[2][1] = B[1] [0] = avg3p(E + 3); /* (0, -1) */ 
B[3][3] = B[2][2] = B[1][1] = B[0][0] = 
avg3p(E + 4); /* (-1, -1) */ 

B[21[3] = B[1][2] = B[0][1] = avg3p(E + 5); /* (-1, 0) */ 
B[1][3] < B[0][2] < avg3p(E t 6); /* (-1, 1) */ 
B[0][3] < avg3p(E * 7); /* (-1, 2) */ 
break; 


/* The remaining 4 diagonal modes use lines whose slopes are 


+|- 2 and +|- 1/2. 
+|- 27 degrees from hori 


Unlike the 45 degree dia 


zontal or vertical. 


gonals, 


here we often need to 


The angles of these lines are roughly 


"synthesize" predictor pixels midway between two actual 


predictors using avg2p(p), 


the pixel "at" p[1/2]. */ 
case B_VR_PRED: /* SSE (vertical right) step = 
(2,1) or (-2,-1) */ 
B[3] [0] = avg3p(E + 2); /* predictor is from (1, 
B[2][0] < avg3p(E * 3); /* (0, -1) */ 
B[3] [1] = B[1][0] = avg3p(E + 4); /* (-1, -1) 
B[2][1] < B[0][0] avg2p(E + 4); /* (-1, -1/2) 
B[3] [2] = B[1][1] = avg3p(E + 5); /* (-l, 0) 
B[21[2] = B[0][1] = avg2p(E + 5); /* (-1, 1/2) 
B[3] [3] = B[1][2] = avg3p(E + 6); /* (-1, 1) 
B[21[3] = B[0][2] = avg2p(E + 6); /* (-1, 3/2) 
Bankoski, et al. Informational 
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B[1][3] < avg3p(E * 7); /* (-1, 2) */ 
B[0][3] < avg2p(E t 7); /* (-1, 5/2) */ 
break; 

case B VL PRED: /* SSW (vertical left) step - 


(2,-1) or (-2,1) */ 
B[0][0] < avg2p(A); /* predictor is from (-1, 1/2) */ 
B[1][0] < avg3p(A * 1); /* (-1, 1) */ 


B[2] [0] = B[0] [1] avg2p(A + 1); /* (-1, 3/2) */ 
B[1] [1] = B[3] [0] avg3p (A + 2); /* (-1, 2) A 
B[2] [1] = BLO] [2] = avg2p(A + 2); /* 4-1; 5/2) */ 
B[3] [1] = B[1] [2] = avg3p(A + 3); /* (-l, 3) */ 
B[2] [2] = BLO] [3] = avg2p(A + 3); /* (-1, 7/2) */ 
B[3] [2] = B[1] [3] = avg3p(A + 4); /* (51, 4) *7 


/* Last two values do not strictly follow the pattern. */ 

B[21[3] < avg3p(A * 5); /* (-1, 5) [avg2p(A t 4) < 
(-1,9/2)] */ 

B[3][3] < avg3p(A * 6); /* (-1, 6) [avg3p(A t 5) - 
(-1,5)] */ 

break; 


case B HD PRED: /* ESE (horizontal down) step - 
(1,2) or (-1,-2) */ 
B[3] [0] = avg2p(E); /* predictor is from (5/2, -1) */ 
B[3][1] = avg3p(E + 1); /* (2, -1) */ 


B[2] [0] = B[3] [2] svg2p(E + 1); /* ( 3/2, -1) */ 
B[2][1] < B[3][3] < avg3p(E t 2); /* ( V; sr) 8/7 
B[2] [2] = B[1][0] = avg2p(E + 2); /* ( 1/2, -1) */ 
B[2] [3] = B[1] [1] = avg3p(E + 3); /* ( Qi SLY EL 
B[1][2] = B[0][0] = avg2p(E + 3); /* (-1/2, -1) */ 
B[1] [3] = B[0] [1] = avg3p(E + 4); /* ( -1, -1) */ 
B[0][2] < avg3p(E t 5); /* (-1, 0) */ 
B[0][3] < avg3p(E t 6); /* (-1, 1) */ 
break; 

case B HU PRED: /* ENE (horizontal up) step = (1,-2) 


or (-1,2) */ 
B[0][0] » avg2p(L); /* predictor is from (1/2, -1) */ 
B[0][1] < avg3p(L * 1); /* (1, -1) */ 
B[0][2] = B[1][0] = avg2p(L + 1 


) ) 

) ) 
B[1]I2] < B[2][0] < avg2p(L t 2); /* (5/2, -1) */ 

L * 


B[0] [3] = B[1] [1] = avg3p(L + 2); /* 
B[1] [3] = B[2] [1] = avg3(LI2], LI3], L[3]); /* (3, -1) */ 
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/* Not possible to follow pattern for much of the bottom 
row because no (nearby) already-constructed pixels lie 
on the diagonals in question. */ 

B[2] [2] = BI21[3] = B[3][0] = B[3][1] = B[3][2] = B[3][3] 

= L[3]; 


} 


=25% Eno. Code block === 5=3===9 33 === SS RE Ss SSeS ERASE RSS 


The reference decoder implementation of subblock intra-prediction may 
be found in predict.c (Section 20.14). 


DCT Coefficient Decoding 


The second data partition consists of an encoding of the quantized 
DCT (and WHT) coefficients of the residue signal. As discussed in 
the format overview (Section 2), for each macroblock, the residue is 
added to the (intra- or inter-generated) prediction buffer to produce 
the final (except for loop filtering) reconstructed macroblock. 


VP8 works exclusively with 4x4 DCTs and WHTs, applied to the 24 (or 
25 with the Y2 subblock) 4x4 subblocks of a macroblock. The ordering 
of macroblocks within any of the "residue" partitions in general 
follows the same raster scan as used in the first "prediction" 
partition. 


For all intra- and inter-prediction modes apart from B PRED (intra: 
whose Y subblocks are independently predicted) and SPLITMV (inter), 
each macroblock's residue record begins with the Y2 component of the 
residue, coded using a WHT. B PRED and SPLITMV coded macroblocks 
omit this WHT and specify the Oth DCT coefficient in each of the 16 Y 
subblocks. 


After the optional Y2 block, the residue record continues with 16 
DCTs for the Y subblocks, followed by 4 DCTs for the U subblocks, 
ending with 4 DCTs for the V subblocks. The subblocks occur in the 
usual order. 


The DCTs and WHT are tree-coded using a 12-element alphabet whose 
members we call "tokens". Except for the end-of-block token (which 
sets the remaining subblock coefficients to zero and is followed by 
the next block), each token (sometimes augmented with data 
immediately following the token) specifies the value of the single 
coefficient at the current (implicit) position and is followed by a 
token applying to the next (implicit) position. 
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the ordering of the coefficients 
DCTs begin at coefficient 1 if Y2 


is present, and begin at coefficient 0 if Y2 is absent. The WHT for 


a Y2 subblock always begins at coefficient 0. 


13.1. Macroblock without Non-Zero Coefficient Values 


If the flag within macroblock (MB) MODE INFO indicates that a 
macroblock does not have any non-zero coefficients, the decoding 
process of DCT coefficients is skipped for the macroblock. 


13.2. Coding of Individual Coefficient Values 


The coding of coefficient tokens is the same for the DCT and WHT, and 
for the remainder of this section "DCT" 


either DCT or WHT. 


All tokens (except end-of-block) 
value or a range of unsigned values 


should be taken to mean 


specify either a single unsigned 
(immediately) 


followed by a 


simple probabilistic encoding of the offset of the value from the 


base of that range. 


Non-zero values (of either type) 
indicating the sign of the coded value 


if 0). 


Below are the tokens and decoding tree. 


---- Begin code block 


typedef enum 
{ 


DCT 0, /* 
DET 1; /* 
DCT 2, /* 
DCT 3; ZE 
DCT 4, {x 


det. catl, /* 
dct, cat2, /* 
dct cat3, /* 
dct cat4, /* 
dct, cat5, /* 
dct cat6, /* 
dct eob, /* 


num dct tokens 


} 
dct token; 
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value 0 

1 */ 

2 */ 

3 %/ 

4 */ 
range 5 - 
7 - 10 

11 - 18 
19 - 34 
35 - 66 


*/ 


67 - 2048 (1982) */ 
end of block %/ 


fr 12 


Af 


Informational 


are then followed by a flag 
(negative if 1, positive 
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const tree index coeff tree [2 * (num dct tokens - 1)] < 
{ 
-dct eob, 2, /* eob < "0" x] 
-DCT. 0, 4, /* 0 = "10" x / 
-DCT 1, 6, /* 1 = "110" */ 
8, 12, 
-DCT 2, 10, /* 2 = "11100" */ 
-DCT 3, -DCT 4, /* 3 e ELLIOTT, 4 s "ISTLOIL"-*/ 
145 T6, 
-dct catl, -dct cat2, /% catl < "111100", 
cat2 = "111101" */ 
18, 20, 
-dct cat3, -dct cat4, /* cat3 - "1111100", 
cat4 - "1111101" */ 
-dct cat5, -dct cat6 /* cat4 = "1111110", 
cat4 z "1111111" %/ 


Nn 
So ay ENG: Code blok ea SaaS aR ISR 5532522285 


In general, all DCT coefficients are decoded using the same tree. 
However, if the preceding coefficient is a DCT 0, decoding will skip 
the first branch, since it is not possible for dct eob to follow a 
DCT 0. 


The tokens dct catl ... dct cat6 specify ranges of unsigned values, 
the value within the range being formed by adding an unsigned offset 
(whose width is 1, 2, 3, 4, 5, or 11 bits, respectively) to the base 
of the range, using the following algorithm and fixed probability 
tables. 
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13. 


===> Begin code block === ===> +++ 22522 


uint DCTextra(bool decoder *d, const Prob *p) 

{ 
uint v = 0; 
do ( v += v + read bool(d, *p);} while (*++p); 
return v; 


const Prob Pcatl[] = 1 
const Prob Pcat2[] = 41 
const Prob Pcat3[] < ( 173, 148, 140, 0); 
const Prob Pcat4[] { 176, 155, 140, 135, 0); 
const Prob Pcat5[] < ( 180, 157, 141, 134, 130, 0); 
const Prob Pcat6[] < 
4 254, 254, 243, 230, 196, 1/7,.153; 140, 133, 130, .129; 0; 


sscecEndscode. block sET5E- Eo SoS SEE SRE SSE LEŽ! Sas 

If v -- the unsigned value decoded using the coefficient tree, 
possibly augmented by the process above -- is non-zero, its sign is 
set by simply reading a flag: 

=== Begin code block <=======3==2=2=59==5=2222=25===55====== 


if (read bool(d, 128)) 
v = -v; 


scccuEndooode BDLOCk =5= === 29H assi Ss E 

3. Token Probabilities 

The probability specification for the token tree (unlike that for the 
"extra bits" described above) is rather involved. It uses three 
pieces of context to index a large probability table, the contents of 
which may be incrementally modified in the frame header. The full 
(non-constant) probability table is laid out as follows. 

=-=- Begin code. block ==========25===25=====+2S=====2=====> 

Prob coeff probs [4] [8] [3] [num dct tokens-1]; 


cp nO WES ae LO dk == ++ SO A ES 
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Working from the outside in, the outermost dimension is indexed by 
the type of plane being decoded: 


o 0 - Y beginning at coefficient 1 (i.e., Y after Y2) 
o 1- Y2 
o 2-UorV 
o 3 - Y beginning at coefficient 0 (i.e., Y in the absence of Y2). 
The next dimension is selected by the position of the coefficient 
being decoded. That position, c, steps by ones up to 15, starting 
from zero for block types 1, 2, or 3 and starting from one for block 
type 0. The second array index is then 
STRA Beginacode DLO CK == A RARR R Se ora sk TRAS RES 
coeff_bands [c] 
==" na ES de DOCK SI a ee et a 
Where: 
sHs>" Begin’ code “block: -==========4=222==20=22++ aS aaa SaesS 
const int coeff_bands [16] = { 

Oy 1, 2, 3, 6, 4, Dn 6,4 6; 6, 6, 6; 6, 6y 6; 1 
y; 
===> “End Code block ========== 5-55 o 5-5 O 5 MES BAS A == 
is a fixed mapping of position to "band". 
The third dimension is the trickiest.  Roughly speaking, it measures 
the "local complexity" or extent to which nearby coefficients are 
non-zero. 
For the first coefficient (DC, unless the block type is 0), we 
consider the (already encoded) blocks within the same plane (Y2, Y, 
U, or V) above and to the left of the current block. The context 
index is then the number (0, 1, or 2) of these blocks that had at 
least one non-zero coefficient in their residue record.  Specifically 
for Y2, because macroblocks above and to the left may or may not have 
a Y2 block, the block above is determined by the most recent 
macroblock in the same column that has a Y2 block, and the block to 


the left is determined by the most recent macroblock in the same row 
that has a Y2 block. 
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Beyond the first coefficient, the context index is determined by the 
absolute value of the most recently decoded coefficient (necessarily 
within the current block) and is 0 if the last coefficient was a 
zero, 1 if it was plus or minus one, and 2 if its absolute value 
exceeded one. 


Note that the intuitive meaning of this measure changes as 
coefficients are decoded. For example, prior to the first token, a 
zero means that the neighbors are empty, suggesting that the current 
block may also be empty. After the first token, because an end-of- 
block token must have at least one non-zero value before it, a zero 
means that we just decoded a zero and hence guarantees that a 
non-zero coefficient will appear later in this block. However, this 
shift in meaning is perfectly okay because the complete context 
depends also on the coefficient band (and since band 0 is occupied 
exclusively by position 0). 


As with other contexts used by VP8, the "neighboring block" context 
described here needs a special definition for subblocks lying along 


the top row or left edge of the frame. These "non-existent" 
predictors above and to the left of the image are simply taken to be 
empty -- that is, taken to contain no non-zero coefficients. 


The residue decoding of each macroblock then requires, in each of two 
directions (above and to the left), an aggregate coefficient 
predictor consisting of a single Y2 predictor, two predictors for 
each of U and V, and four predictors for Y. In accordance with the 
Scan-ordering of macroblocks, a decoder needs to maintain a single 
"left" aggregate predictor and a row of "above" aggregate predictors. 


Before decoding any residue, these maintained predictors may simply 
be cleared, in compliance with the definition of "non-existent" 
prediction. After each block is decoded, the two predictors 
referenced by the block are replaced with the (empty or non-empty) 
state of the block, in preparation for the later decoding of the 
blocks below and to the right of the block just decoded. 


The fourth, and final, dimension of the token probability array is of 
course indexed by (half) the position in the token tree structure, as 
are all tree probability arrays. 


The pseudocode below illustrates the decoding process. Note that 
criteria, functions, etc. delimited with ** are either dependent on 
decoder architecture or are elaborated on elsewhere in this document. 
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===> Begin code, block ===2======222=2223=2+ 92922292922 


int block[16] = ( 0 ); /* current 4x4 block coeffs */ 

int firstCoeff = 0; 

int plane; 

int ctx2; 

int ctx3 = 0; /* the 3rd context referred to in above description */ 

Prob *probTable; 

int token; 

int sign; 

int absValue; 

int extraBits; 

bool prevCoeffWasZero = false; 

bool currentBlockHasCoeffs - false; 

/* base coeff abs values per each category, elem #0 is 
DCT VAL CATEGORY1, * #1 is DCT VAL CATEGORY2, etc. */ 

int categoryBase[6] < ( 5, 7, 11, 19, 35, 67 ); 


/* Determine plane to use */ 


if ( **current block is Y2 block** ) plane < 0; 
else if ( **current block is chroma** ) plane < 2; 
else if ( **current macroblock has Y2** ) plane -» 1; 
else plane = 3; 


/* For luma blocks of a "Y2 macroblock" we skip coeff index #0 */ 
if ( plane == 1 ) 
firstCoefftt; 


/* Determine whether neighbor 4x4 blocks have coefficients. 
This is dependent on the plane we are currently decoding; 
i.e., we check only coefficients from the same plane as the 
current block. */ 


if ( **left neighbor block has coefficients (plane)** ) 
ctx3++; 

if ( **above neighbor block has coefficients (plane) ** ) 
ctx3++; 


for( i = firstCoeff; i < 16; ++i ) 


ctx2 = coeff bands[il; 
probTable = coeff_probs [plane] [ctx2] [ctx3]; 


/* skip first code (dct_eob) if previous token was DCT_O */ 
if ( prevCoeffWasZero ) 
token = treed read ( d, **coeff tree without eob**, 
probTable ); 
else 
token < treed read ( d, coeff tree, probTable ); 


Bankoski, et al. Informational [Page 66] 


RFC 6386 VP8 Data Format and Decoding Guide 
if ( token << dct eob ) 
break; 
if ( token != DCT 0 ) 
{ 
currentBlockHasCoeffs = true; 
if ( **token has extra bits(token)** ) 
{ 
extraBits = DCTextra( token ); 
absValue = 
categoryBase[**token_to_cat_index (token) **] 
extraBits; 
} 
else 
{ 
absValue = **token_to_abs_value (token) **; 
} 
sign = read_bool(d, 128); 
block[i] = sign ? -absValue absValue; 
} 
else 


{ 
absValue = 0; 


} 


/* Set contexts and stuff for next coeff */ 


if ( absValue == ) Ctx? =" 07 
else if ( absValue == 1 ) ctx3 = 1; 
else ctx3 = 2; 
prevCoeffWasZero - true; 


} 


/* Store current block status to decoder internals */ 


**block has coefficients[currentMb][currentBlock]** = 
currentBlockHasCoeffs; 


--— End code block 


November 2011 


While we have in fact completely described the coefficient decoding 


procedure, 
reference implementation, 
(Section 20.16). 


Informational 
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the reader will probably find it helpful to consult the 
which can be found in the file tokens.c 
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13.4. Token Probability Updates 


As mentioned above, the token-decoding probabilities may change from 
frame to frame. After detection of a key frame, they are of course 
set to their defaults as shown in Section 13.5; this must occur 
before decoding the remainder of the header, as both key frames and 
interframes may adjust these probabilities. 


The layout and semantics of the coefficient probability update record 
(Section I of the frame header) are straightforward. For each 
position in the coeff probs array there occurs a fixed-probability 
bool indicating whether or not the corresponding probability should 
be updated. If the bool is true, there follows a P(8) replacing that 
probability. Note that updates are cumulative; that is, a 
probability updated on one frame is in effect for all ensuing frames 
until the next key frame, or until the probability is explicitly 
updated by another frame. 


The algorithm to effect the foregoing is simple: 


if (read bool(d, coeff update probs [i] [3] [k] [t])) 
coeff probs [i] [j] [k] [t] = read literal(d, 8); 


) while (++t < num dct tokens - 1); 
} while (++k < 3); 
) while (ttj < 8); 
) while (++i « 4); 


va > End code block sa SSS 529 Se SS SSeS SSeS so eS Ses 
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The (constant) update probabilities are as follows: 


-—-- Begin code block ==============2= === => 


const Prob coeff update probs [4] [8] [3] [num dct tokens-1] - 


{ 255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 
ío255, 255, 295p 255,.255,.255, 255, 255, 255, 255, ZI}, 
( 299; 255,255; 255, 255, 255, 255, 255, 255, 255, 255] 


1' 1767 246p 255, 255, 255, 255, 255, 255, 255, 299, 255, 
(223, 241, 252; 255, 255, 255, 255, 255, 255, 255, 255 
10249, 293; 293; 255, 255, 255, 255, 2555, 2994299; 2255] 


1 255, 244, 252, 255, 255, 255, 255, 255, 255, 255, 255], 
{ 234, 254, 254, 255, 255, 255, 255, 255, 255, 255, 255], 
( 253, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255] 


{ 255, 246, 254, 255, 255, 255, 255, 255, 255, 255, 299), 
t 239, 253, 254, 255, 255, 255, 255, 255, 255, 299, 2959]; 
(2544 255,.254;, 255, 255, 255, 255, 255, 255, 255, 255] 


( 255, 248, 254, 255, 255, 255, 255, 255, 255, i255, 255], 
( 251, 255, 254, 255, 255, 255, 255, 255, 255, 255, 255], 
1 255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255] 


1 255, 253, 254, 255, 255, 255, 255, 255, 255, 255, 259]; 
{ 251,4 254, 254, 255,.255, 255, 255, 255, 255, 255, 299), 
1 254; 255, 254, 255, 255, 255, 255, 255, 255, 255, 2255] 


{ 255, 254, 253, 255, 254, 255, 255, 255, 255, 255, 299), 
( 250, 255, 254, 255, 254, 255, 255, 255, 255, 255, 255], 
( 254, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255] 


{ 255,255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 
1 255; 255, 255, 255, 255, 255, 255, 255, 255, 2555, 255 
{ 259; 255, 255, 255, 255, 255, 255, 255, 255, 255, 255] 
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1 217; 2590, 255, 255,:255, 255, 255, 255, 255, 2525, 295]; 
( 225, 252, 241, 253, 255, 255, 254, 255, 255, 255, 255], 
( 234, 250, 241, 250, 253, 255, 253, 254, 255, 255, 255] 


(255, 254, 255, 255, 255, 255, 255, 255, 255, 255, 255b 
( 223, 254, 254, 255, 255, 255, 255, 255, 255, 255, 255], 
1:238, 253, 254, 254, 255, 255, 255, 255; 255, 255,-2255] 


1 255, 248, 254, 255, 255, 255, 255, 255, 255, 255, 255], 
1249, 254, 255, 255, 255, 255, 255, 255, 255,255, 255] 
( 299, 255, 255, 255, 255, .255,.255, 255, 255, 255, 255] 


[2255, 253, 299; 2597 295; 255, 2957 255-255, 255, 255]; 
( 247, 254, 255, 255, 255, 255, 255, 255, 255, 255, 255], 
{ 255; 255, 255, 255, 255, 255, 255, 255, 255, 255, 255] 


1.255, 259, 294; 255, 255,255, 255, 255, 255, 295, 2955]; 
1 252,4 2555 255, 255, 255, 255, 255, 255, 255, 255, 299), 
1.255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 295] 


1 255, 254, 254, 255, 255, 255, 255, 255, 255, 255, 255], 
[ 253,255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 
1 255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255] 


[-255, 294, 253, 2597; 2997 255, (299; 255, -255, 25», 255]; 
( 250, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 
{- 254, 255, 255, 255, 255, 255, 255, 2957y 255, 255, 255] 


1.25554 299p 2957 25597 2554, 255, 255, 2554 255,4 299; 295]; 
{ 299; 255, 255,255, 255, 255, 2555, 255, 25554 255, "29977 
{ 2997 255, 295, 255, 255, 255, 255, 255, 2997-2097299} 


( 186, 251, 250, 255, 255, 255, 255, 255, 255, 255, 255], 
( 234, 251, 244, 254, 255, 255, 255, 255, 255, 255, 255], 
( 251, 251, 243, 253, 254, 255, 254, 255, 255, 255, 255] 
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{ 299% 253, 254, .255, 255, 255, 255, 255, 25554 255, 299], 
{ 236, 253, 2954, 255,.255, 255, 255, 255, 255, 255, 255 
( 251, 253, 253, 254, 254, 255, 255, 255, 255, 255, 255] 


{ 255, 254, 254, 255, 255, 255, 255, 255, 255, 255, 255], 
( 254, 254, 254, 255, 255, 255, 255, 255, 255, 255, 255], 
{ 255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255] 


1.255, 254, 255,255, 255, 255,255, 25554, 255, 292, 295]; 
1. 254, 254; 255, 255, .255, 255, 255, 2997 255, 255, 255, 
1254, 255, 295; 2755,.255, 255, 255, 255, 255, 2955, 22595] 


1 255, 255, 255,.255, 255, 255, 255, 255, 255, 255, 255b, 
1:254; 3255, 25554,0255, .255,./255, 255, 255; -255, “2907. 299]; 
1 255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255] 


[.255, 255, 255, 255, 255, 255, 255, 255, 255, :255, 2554, 
1.2555, 25955, 295,4. 255, 259, 255, 255, 255, 255, 299, 295]; 
1-255, 2557 (295; 295; 25), 255, 255, 255, 299; 255, 255] 


1.255, 259, 255, .255, 255, 255, 255, 255, 255; 292,5 255]; 
i 2997 255, 255,255, .255, 255, 299; 2090, 255, 255, 299], 
(299; 255,.255, :255, 255, 255, 255, 255, 255, 255, 255] 


1 255, 255, 255, 255, 299, 255, 255, 255, 255, 255, 259], 
{ 299, 3255, 2554. 255, 255, 255, 255, 255; 255, 255, 2559; 
1 255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255] 


{ 248; 259, 255, 255, 255, 255, 255, 255, 255,4 255, 299), 
1 250; 254, 252, 254, 255, 255, 255, 255, 255, 255, 295}, 
( 248, 254, 249, 253, 255, 255, 255, 255, 255, 255, 255] 


255, 253, 253, 255, 255, 255, 255, 255, 255, 255, 255], 
246, 253, 253, 255, 255, 255, 255, 255, 255, 255, 255], 
252, 254, 251, 254, 254, 255, 255, 255, 255, 255, 255] 


AAA 


Bankoski, et al. Informational [Page 71] 


RFC 6386 VP8 Data Format and Decoding Guide November 2011 


{ 2554. 254, 252,205, 255, 255, 255, 255, 25554, 255, 299], 
(248, 254, 253, 255y.255, 255, 255, 255, 255, 255, ZI}; 
( 253, 255, 254, 254, 255, 255, 255, 255, 255, 255, 255] 


[ 255, 251, 254, 255, 255, 255, 255, 255, 255, 299, 255], 
1 245, 251, 254, 255, 255, 255, 255, 255, 255, 255, 255], 
[.253, 253, 254, 255, 255, 255, 255, 255, 255, 255, 255] 


1.25554 291, 253,255, 255, 255,255, 25554, 255, 292, 2955]; 
1. 292; 253, 254, 255,.255, 255, 255,255, 255, 255, 255, 
[255,5 254, 255, 755,.255, 255, 255, 255, 255, 2955, 22595] 


1 255, 252, 255; 2097 255, 255, 255, 255, 255, 255, 255b, 
[.:249, 3255, 254, 255, 255, 3255, 255, 255; 255, 255, 299]; 
1. 255, 255, 254, 255, 255,.255, 255, 255, 255, 255, 255] 


( 29597; 255, 253, 255, 255, 255, 255, 255, 255; :255, 29554, 
1-2504. 2555,.-255,..255, 255, 255, 255, 255, 255, 299; 295]; 
1-255, 2557 255, 295; 25), 255, 255, 255, 299; 255, 255] 


1.255, 259, 255, .255, 255, 255, 255, 255, 255; 292,5 255]; 
{ 254; 255, 255, :255, 255, 255, 299; 2090, 255, 299, 255, 
[ 255, 255; 2557 :255, 255, 255, 255, 255, 255, 255, 255] 


Soon Ena code block. == SR aa SROs 
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13.5. Default Token Probability Table 
The default token probabilities are as follows. 
==== Begin.code block. zze-- Too sess ssa ooo o-----t A-----— 


const Prob default coeff probs [4] [8] [3] [num dct tokens - 1] < 


( 128, 128, 128, 128, 128, 128, 128, 128, 128, 128, 128], 
( 128, 128, 128, 128, 128, 128, 128, 128, 128, 128, 128], 
1 128, 128, 128, 128, 128, 128, 128, 128, 128, 128, 128) 


( 253, 136, 254, 255, 228, 219, 128, 128, 128, 128, 128], 
{ 189, 129, 242, 255, 2217 213, 255, 219, 128, 128, 128}; 
1 106, 126, 227, 252, 214, 209, 255, 255, 128, 128, 128} 


{ 1, 98, 248, 255, 236, 226, 255, 255, 128, 128, 128}, 
{ 181, 133, 238, 254, 221, 234, 255, 154, 128, 128, 128}, 
{ 78, 134, 202, 247, 198, 180, 255, 219, 128, 128, 128} 


{ 1, 185, 249, 255, 243, 255, 128, 128, 128, 128, 128}, 
{ 184, 150, 247, 255, 236, 224, 128, 128, 128, 128, 128}, 
{ 77, 110, 216, 255, 236, 230, 128, 128, 128, 128, 128} 


{ 1, 101, 251, 255, 241, 255, 128, 128, 128, 128, 128}, 
{ 170, 139, 241, 252, 236, 209, 255, 255, 128, 128, 128), 
( 37, 116, 196, 243, 228, 255, 255, 255, 128, 128, 128] 


{ 1, 204, 254, 255, 245, 255, 128, 128, 128, 128, 128}, 
{ 207, 160, 250, 255, 238, 128, 128, 128, 128, 128, 128}, 
{ 102, 103, 231, 255, 211, 171, 128, 128, 128, 128, 128} 


{ 1, 152, 252, 255, 240, 255, 128, 128, 128, 128, 128}, 
{ 177, 135, 243, 255, 234, 225, 128, 128, 128, 128, 128}, 
{ 80, 129, 211, 255, 194, 224, 128, 128, 128, 128, 128} 
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{ 1, 1, 255, 128, 128, 128, 128, 128, 128, 128, 128), 
1 246, 1, 255, 128, 128, 128, 128, 128, 128, 128, 128), 
1 255, 128, 128, 128, 128, 128, 128, 128, 128, 128, 128) 


1 198, 35, 237, 223, 193, 187, 162, 160, 145, 155, 62}, 
1 131, 45, 198, 221, 172, 176, 220, 157, 252, 221, li; 
1 68, 47, 146, 208, 149, 167, 221, 162, 255, 223, 128) 


1 1, 149, 241, 255, 221, 224, 255, 255, 128, 128, 128), 
1 184, 141, 234, 253, 222, 220, 255, 199, 128, 128, 128), 
{ 81, 99, 181, 242, 176, 190, 249, 202, 255, 255, 128} 


1 1, 129, 232, 253, 214, 197, 242, 196, 255, 255, 128), 
( 99, 121, 210, 250, 201, 198, 255, 202, 128, 128, 128], 
{ 23, 91, 163, 242, 170, 187, 247, 210, 255, 255, 128) 


{ 1, 200, 246, 255, 234, 255, 128, 128, 128, 128, 128}, 
{ 109, 178, 241, 255, 231, 245, 255, 255, 128, 128, 128}, 
{ 44, 130, 201, 253, 205, 192, 255, 255, 128, 128, 128} 


{ 1, 132, 239, 251, 219, 209, 255, 165, 128, 128, 128}, 
{ 94, 136, 225, 251, 218, 190, 255, 255, 128, 128, 128}, 
{ 22, 100, 174, 245, 186, 161, 255, 199, 128, 128, 128} 


{ 1, 182, 249, 255, 232, 235, 128, 128, 128, 128, 128}, 
{ 124, 143, 241, 255, 227, 234, 128, 128, 128, 128, 128}, 
{ 35, 77, 181, 251, 193, 211, 255, 205, 128, 128, 128} 


{ 1, 157, 247, 255, 236, 231, 255, 255, 128, 128, 128}, 
{ 121, 141, 235, 255, 225, 227, 255, 255, 128, 128, 128}, 
{ 45, 99, 188, 251, 195, 217, 255, 224, 128, 128, 128} 


{ 1, 1, 251, 255, 213, 255, 128, 129, 128; 128, 128], 
( 203, 1, 248, 255, 255, 128, 128, 128, 128, 128, 128], 
( 137, 1, 177, 255, 224, 255, 128, 128, 128, 128, 128] 
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( 253, 9, 248, 251, 207, 208, 255, 192, 128, 128, 128], 
1 175, 13, 224, 243, 193, 185, 249, 198, 255, 255, 128}, 
{ 73; 17, 171, 221, 1061, 179, 230, 107, 255, 232, 128) 


{ 1, 95, 247, 253, 212, 183, 255, 255, 128, 128, 128}, 
{ 239, 90, 244, 250, 211, 209, 255, 255, 128, 128, 128}, 
{ 155, 77, 195, 248, 188, 195, 255, 255, 128, 128, 128} 


{ 1, 24, 239, 251, 218, 219, 255, 205, 128, 128, 128}, 
{ 201, 51, 219, 255, 196, 186, 128, 128, 128, 128, 128}, 
{ 69, 46, 190, 239, 201, 218, 255, 228, 128, 128, 128} 


{ 1, 191, 251, 255, 255, 128, 128, 128, 128, 128, 128}, 
{ 223, 165, 249, 255, 213, 255, 128, 128, 128, 128, 128}, 
{ 141, 124, 248, 255, 255, 128, 128, 128, 128, 128, 128} 


{ 1, 16, 248, 255, 255, 128, 128, 128, 128, 128, 128}, 
{ 190, 36, 230, 255, 236, 255, 128, 128, 128, 128, 128}, 
{ 149, 1, 255, 128, 128, 128, 128, 128, 128, 128, 128} 


{ 1, 226, 255, 128, 128, 128, 128, 128, 128, 128, 128}, 
{ 247, 192, 255, 128, 128, 128, 128, 128, 128, 128, 128}, 
{ 240, 128, 255, 128, 128, 128, 128, 128, 128, 128, 128} 


{ 1, 134, 252, 255, 255, 128, 128, 128, 128, 128, 128}, 
{ 213, 62, 250, 255, 255, 128, 128, 128, 128, 128, 128}, 
{ 55, 93, 255, 128, 128, 128, 128, 128, 128, 128, 128} 


{ 128, 128, 128, 128, 128, 128, 128, 128, 128, 128, 128}, 
{ 128, 128, 128, 128, 128, 128, 128, 128, 128, 128, 128}, 
{ 128, 128, 128, 128, 128, 128, 128, 128, 128, 128, 128} 


{ 202, 24, 213, 235,.186, 191, 220, 160; 240, 175, 255}, 
1 126, 38, 182, 232, 169, 184, 228, 174, 255, 187, 128}, 
t 61, 46, 138, 219, 151, 178, 240, 170, 255, 216, 128) 
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{ 1, 1127 230, 250, 199; 194; 247, 159, 255, 255, 128); 
1 166, 109, 228, 252, 211, 215, 255, 174, 128, 128, 128), 
1 39, 777; 162, 232, 172, 180, 245, 178, 255, 255, 128) 


{ 1, 52, 220, 246, 198, 199, 249, 220, 255, 255, 128}, 
{ 124, 74, 191, 243, 183, 193, 250, 221, 255, 299,: 128), 
( 24, 71, 130, 219, 154, 170, 243, 182, 255, 255, 128] 


{ 1, 182, 225, 249, 219, 240, 255, 224, 128, 128, 128}, 
{ 149, 150, 226, 252, 216, 205, 255, 171, 128, 128, 128}, 
{ 28, 108, 170, 242, 183, 194, 254, 223, 255, 255, 128} 


{ 1, 81, 230, 252, 204, 203, 255, 192, 128, 128, 128}, 
( 123, 102, 209, 247, 188, 196, 255, 233, 128, 128, 128], 
( 20, 95, 153, 243, 164, 173, 255, 203, 128, 128, 128) 


1 1, 222, 248, 255, 216, 213, 128, 128, 128, 128, 128), 
1. $268, 175, 246, 252, 239; 205, 255, 255, 128, 128, 1283), 
{ 47, 116, 215, 255, 211, 212, 255, 255, 128, 428, 128) 


{ 1, 1215, 230, 259, 212), 214, -255, 255, 128, 128, 128); 
( 141, 84, 213, 252, 201, 202, 255, 219, 128, 128, 128}, 
{ 42, 80, 160, 240, 162, 185, 255, 205, 128, 128, 128) 


{ 1, 1, 255, 128, 128, 128, 128, 128, 128, 128, 128), 
( 244, 1, 255, 128, 128, 128, 128, 128, 128, 128, 128), 
1 238, 1, 255, 128, 128, 128, 128, 128, 128, 128, 128) 


14. DCT and WHT Inversion and Macroblock Reconstruction 

14.1.  Deguantization 
After decoding the DCTs/WHTs as described above, each (quantized) 
coefficient in each subblock is multiplied by one of six 


deguantization factors, the choice of factor depending on the plane 
(Y2, Y, or chroma) and position (DC = coefficient zero, AC = any 
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other coefficient). If the current macroblock has overridden the 
quantizer level (as described in Section 10), then the six factors 
are looked up from two dequantization tables with appropriate scaling 
and clamping using the single index supplied by the override. 
Otherwise, the frame-level dequantization factors (as described in 
Section 9.6) are used. In either case, the multiplies are computed 
and stored using 16-bit signed integers. 


The two dequantization tables, which may also be found in the 
reference decoder file dequant data.h (Section 20.3), are as follows. 


--2- Begin code block seo -E-- oo oz omo omo SO 5- (oo 


static const int dc glookup[OINDEX RANGE] < 
4, 5, 6, T 8, 9, 10, 10, lis d2,; "3, da, 15; 
EG 477 l7, 18; -L9,. (20, 205. 21; 2l, 22, 22, 723,5. 23; 
24; 25, -25, :260, <27, 28, 29, 30; 31;. 32;' 329, 34; 35; 
36; 37, 37, 38, 39, 40, 41, 42; 43, 44, 45, 46, 46, 
47,. “AS, 49, -b0;. Di; 52, .B3,. 54; B5b,. DO; -51y DB “59y 
60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 
X34 UL CH. des Chop CIT; 8, T9 80, 81, 82, 83, 84, 
85, .086,-'87]4. .88p. 99, 91, 93,.-95, 96, 98, 100, 101, 102, 
104, 106, 108, 110, 112, 114, 116, 118, 122, 124, 126, 128, 130, 
132, 134, 136, 138, 140, 143, 145, 148, 151, 154, 157, 

y 


static const int ac glookup[OINDEX RANGE] = 
{ 
4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 
IJ CE8, | 19, 207 21, '22,. 23, 24; 25, 26. 27, 28,4 29, 
30;: 3l, 32; . 33y- 34, 35, “36, 37, :38p..395. 40, 4l, 42, 
43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 
56, 57, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 
80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 
106, 108, 110, 112, 114, 116, 119, 122, 125, 128, 131, 134, 137, 
140, 143, 146, 149, 152, 155, 158, 161, 164, 167, 170, 173, 177, 
181, 185, 189, 193, 197, 201, 205, 209, 213, 217, 221, 225, 229, 
234, 239, 245, 249, 254, 259, 264, 269, 274, 279, 284, 
y 


==== End: Code o Korel -sSSS2 SSS SSS sss SSS Ss SSS > ts ore SE 


Lookup values from the above two tables are directly used in the DC 
and AC coefficients in Yl, respectively. For Y2 and chroma, values 
from the above tables undergo either scaling or clamping before the 
multiplies. Details regarding these scaling and clamping processes 
can be found in related lookup functions in dixie.c (Section 20.4). 
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14 


14. 


2A Inverse Transforms 


If the Y2 residue block exists (i.e., the macroblock luma mode is not 
SPLITMV or B PRED), it is inverted first (using the inverse WHT) and 
the element of the result at row i, column j is used as the Oth 
coefficient of the Y subblock at position (i, j), that is, the Y 
subblock whose index is (i * 4) + j. As discussed in Section 13, if 
the luma mode is B PRED or SPLITMV, the Oth Y coefficients are part 
of the residue signal for the subblocks themselves. 


In either case, the inverse transforms for the sixteen Y subblocks 
and eight chroma subblocks are computed next. All 24 of these 
inversions are independent of each other; their results may (at least 
conceptually) be stored in 24 separate 4x4 arrays. 


As is done by the reference decoder, an implementation may wish to 
represent the prediction and residue buffers as macroblock-sized 
arrays (that is, a 16x16 Y buffer and two 8x8 chroma buffers). 
Regarding the inverse DCT implementation given below, this requires a 
simple adjustment to the address calculation for the resulting 
residue pixels. 


3. Implementation of the WHT Inversion 


As previously discussed (see Sections 2 and 13), for macroblocks 
encoded using prediction modes other than B PRED and SPLITMV, the DC 
values derived from the DCT transform on the 16 Y blocks are 
collected to construct a 25th block of a macroblock (16 Y, 4 U, 4 V 
constitute the 24 blocks). This 25th block is transformed using a 
Walsh-Hadamard transform (WHT). 


The inputs to the inverse WHT (that is, the dequantized 
coefficients), the intermediate "horizontally detransformed" signal, 
and the completely detransformed residue signal are all stored as 
arrays of 16-bit signed integers. 


Following the tradition of specifying bitstream format using the 
decoding process, we specify the inverse WHT in the decoding process 
using the following C-style source code: 
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---- Begin code block 


void vp8 short inv walsh4x4 c(short *input, 


{ 
int i; 
int al, 
int a2, 


int templ, 


bl, 
b2, 
short *ip = 
short *op 
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GT 
c2, 


dl; 
a2; 


input; 
output; 
temp2; 


for (i=0;1<4;1++) 


{ 
al = 
bl = 
cl 
dl = 


op [0] 
op [4] 
op [8] 


ip[0] 
ip[4] 
ip[4] 
ip[0] 


= al 


op[12]= 


iptt; 
optt; 
} 


cl 
al 
dl 


ip = output; 
op = output; 
for (i1=0;1<4; i++) 


{ 
al = 
bl 
cl 
dl = 


a2 = 
b2 = 
c2 = 
d2 = 


Bankoski, 


ip[0] 
ip[1] 
ip[1] 
ip[0] 


al 
cl 
al 
di 


+ 


et al. 


+ 
+ 


+ + 


bl; 
dl; 
bl; 
el; 


ip[12]; 
ipl8]; 
ipl8]; 
ip[12]; 


b1; 
dl; 
bl; 
el; 


ipl3]; 
ip[2]; 
ip[2]; 
ip[3]; 


Informational 


short *output) 
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op[0] = (a2+3)>>3; 
op[1] = (b2+3)>>3; 
op[2] = (c2+3)>>3; 
op[3] = (d2+3)>>3; 
ip+=4; 
op+=4; 


} 
cececJgEnd-eode block === 1922992298284 


In the case that there is only one non-zero DC value in input, the 
inverse transform can be simplified to the following: 


===> Begin code block -2—----2——-5--e249—-5-e45-o02-f4X—95--5— 


void vp8 short inv walsh4x4 1 c(short "input, short *output) 
{ 

int i; 

int al; 

short *op-output; 


al = ((input[0] + 3)>>3); 


for (i=0;1<4; i++) 
{ 


op[0] = a1; 
op[1] = al; 
op[2] < a1; 
op[3] < al; 
op+=4; 


} 

tes End code block Ss Sacha 252522229 9H HSH > 

It should be noted that a conforming decoder should implement the 
inverse transform using exactly the same rounding to achieve bit-wise 


matching output to the output of the process specified by the above 
C source code. 


The reference decoder WHT inversion may be found in the file 
idct add.c (Section 20.8). 
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14.4. Implementation of the DCT Inversion 


All of the DCT inversions are computed in exactly the same way. In 
principle, VP8 uses a classical 2-D inverse discrete cosine 
transform, implemented as two passes of 1-D inverse DCT. The 1-D 
inverse DCT was calculated using a similar algorithm to what was 
described in [Loeffler]. However, the paper only provided the 
8-point and 16-point version of the algorithms, which was adapted by 
On2 to perform the 4-point 1-D DCT. 


Accurate calculation of 1-D DCT of the above algorithm requires 
infinite precision.  VP8 of course can use only a finite-precision 
approximation. Also, the inverse DCT used by VP8 takes care of 
normalization of the standard unitary transform; that is, every 
dequantized coefficient has roughly double the size of the 
corresponding unitary coefficient. However, at all but the highest 
datarates, the discrepancy between transmitted and ideal coefficients 
is due almost entirely to (lossy) compression and not to errors 
induced by finite-precision arithmetic. 


The inputs to the inverse DCT (that is, the dequantized 
coefficients), the intermediate "horizontally detransformed" signal, 
and the completely detransformed residue signal are all stored as 
arrays of 16-bit signed integers. The details of the computation are 
as follows. 


It should also be noted that this implementation makes use of the 
16-bit fixed-point version of two multiplication constants: 


sqrt(2) * cos (pi/8) 
sqrt(2) * sin (pi/8) 


Because the first constant is bigger than 1, to maintain the same 
16-bit fixed-point precision as the second one, we make use of the 


fact that 

x * a = x + x* (a-1) 

therefore 

x * sqrt(2) * cos (pi/8) < x * x * (sqrt(2) * cos(pi/8)-1) 
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---- Begin code block 


/* IDCT implementation */ 
static const int cospi8sqrt2minus1-20091; 


mber 2011 


static const int sinpi8sqrt2 
void short idct4x4llm c(short *input, 


( 
int- 1; 
int al, bl, cl, dl; 
short *ip-input; 
short *op-output; 
int templ, temp2; 
int shortpitch 


for (i=0;i<4;i++) 
{ 
al = 
b1 


ip[0]+ip[8]; 
= ip[0]-ip[8]; 
templ = 
temp2 
cl 


(ip[4] 


templ - temp2; 
templ - 
temp2 
al 


ip[4] + 
(ip [12] 
templ + temp2; 


op[shortpitch*0] 
op[shortpitch*3] 
op[shortpitch*1] 
op[shortpitch*2] 


iptt; 
optt; 
} 
ip = output; 
op = output; 
for (i=0;1<4; i++) 
{ 


al = ip[0]+ip[21; 
bl = ipl0]-ipl2]; 
templ = (ip[1] 


temp2 = ip[31+((ipl 


cl = templ - temp2; 
templ = ip[1] + 
Bankoski, et al. 


( (ip[4] 
* sinpi8sqrt2)>>16; 


((ip[1] 


=35468; 


short *output, int pitch) 


pitch>>1; 


* sinpi8sqrt2) >>16; 
ip[12]+((ip[12] 


* cospi8sgrtž2minusl)>>16); 


* cospi8sqrt2minusl1)>>16); 


altdl; 
al-dl; 
bl+c1; 
bl-cl; 


* sinpi8sqrt2)>>16; 


3] * cospi8sgrt2minusl)>>16); 


* cospi8sqrt2minus1)>>16); 
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14. 


temp2 = (ip[3] * sinpi8sgrt2)>>16; 
dl = templ + temp2; 


op[0] (altdi14)>>3; 
op[3] = (al-d1+4)>>3; 
op[1] = (bl*cl*4)»53; 
op[2] = (bl-cl14)>>3; 


ip+=shortpitch; 
op+=shortpitch; 


} 
+=2>"Bnd code: block, === 5222522295922 25925 


The reference decoder DCT inversion may be found in the file 
idct_add.c (Section 20.8). 


5. Summation of Predictor and Residue 


Finally, the prediction and residue signals are summed to form the 
reconstructed macroblock, which, except for loop filtering (taken up 
next), completes the decoding process. 


The summing procedure is fairly straightforward, having only a couple 
of details. The prediction and residue buffers are both arrays of 
16-bit signed integers. Each individual (Y, U, and V pixel) result 
is calculated first as a 32-bit sum of the prediction and residue, 
and is then saturated to 8-bit unsigned range (using, say, the 
clamp255 function defined above) before being stored as an 8-bit 
unsigned pixel value. 


VP8 also supports a mode where the encoding of a bitstream guarantees 
all reconstructed pixel values between 0 and 255; compliant 
bitstreams of such requirements have the clamp_type bit in the frame 
header set to 1. In such a case, the clamp255 function is no longer 
required. 


The summation process is the same, regardless of the (intra or inter) 
mode of prediction in effect for the macroblock. The reference 
decoder implementation of reconstruction may be found in the file 
idct add.c. 
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15. Loop Filter 


Loop filtering is the last stage of frame reconstruction and the 
next-to-last stage of the decoding process. The loop filter is 
applied to the entire frame after the summation of predictor and 
residue signals, as described in Section 14. 


The purpose of the loop filter is to eliminate (or at least reduce) 
visually objectionable artifacts associated with the semi- 
independence of the coding of macroblocks and their constituent 
subblocks. 


As was discussed in Section 5, the loop filter is "integral" to 
decoding, in that the results of loop filtering are used in the 
prediction of subsequent frames. Consequently, a functional decoder 
implementation must perform loop filtering exactly as described here. 
This is distinct from any postprocessing that may be applied only to 
the image immediately before display; such postprocessing is entirely 
at the option of the implementor (and/or user) and has no effect on 
decoding per se. 


The baseline frame-level parameters controlling the loop filter are 
defined in the frame header (Section 9.4) along with a mechanism for 
adjustment based on a macroblock’s prediction mode and/or reference 
frame. The first is a flag (filter_type) selecting the type of 
filter (normal or simple); the other two are numbers 

(loop filter level and sharpness_level) that adjust the strength or 
sensitivity of the filter. As described in Sections 9.3 and 10, 
loop filter level may also be overridden on a per-macroblock basis 
using segmentation. 


Loop filtering is one of the more computationally intensive aspects 
of VP8 decoding. This is the reason for the existence of the 
optional, less-demanding simple filter type. 


Note carefully that loop filtering must be skipped entirely if 

loop filter level at either the frame header level or macroblock 
override level is 0. In no case should the loop filter be run with a 
value of 0; it should instead be skipped. 


We begin by discussing the aspects of loop filtering that are 
independent of the controlling parameters and type of filter chosen. 
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15. 


1. Filter Geometry and Overall Procedure 
The Y, U, and V planes are processed independently and identically. 


The loop filter acts on the edges between adjacent macroblocks and on 
the edges between adjacent subblocks of a macroblock. All such edges 
are horizontal or vertical. For each pixel position on an edge, a 
small number (two or three) of pixels adjacent to either side of the 
position are examined and possibly modified. The displacements of 
these pixels are at a right angle to the edge orientation; that is, 
for a horizontal edge, we treat the pixels immediately above and 
below the edge position, and for a vertical edge, we treat the pixels 
immediately to the left and right of the edge. 


We call this collection of pixels associated to an edge position a 
segment; the length of a segment is 2, 4, 6, or 8.  Excepting that 
the normal filter uses slightly different algorithms for, and either 
filter may apply different control parameters to, the edges between 
macroblocks and those between subblocks, the treatment of edges is 
quite uniform: All segments straddling an edge are treated 
identically; there is no distinction between the treatment of 
horizontal and vertical edges, whether between macroblocks or between 
subblocks. 


As a consequence, adjacent subblock edges within a macroblock may be 
concatenated and processed in their entirety. There is a single 
8-pixel-long vertical edge horizontally centered in each of the U and 
V blocks (the concatenation of upper and lower 4-pixel edges between 
chroma subblocks), and three 16-pixel-long vertical edges at 
horizontal positions 1/4, 1/2, and 3/4 the width of the luma 
macroblock, each representing the concatenation of four 4-pixel 
sub-edges between pairs of Y subblocks. 


The macroblocks comprising the frame are processed in the usual 
raster-scan order. Each macroblock is "responsible for" the 
inter-macroblock edges immediately above and to the left of it (but 
not the edges below and to the right of it), as well as the edges 
between its subblocks. 


For each macroblock M, there are four filtering steps, which are, 
(almost) in order: 


T If M is not on the leftmost column of macroblocks, filter across 
the left (vertical) inter-macroblock edge of M. 


2. Filter across the vertical subblock edges within M. 
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3. If M is not on the topmost row of macroblocks, filter across the 
top (horizontal) inter-macroblock edge of M. 


4. Filter across the horizontal subblock edges within M. 


We write MY, MU, and MV for the planar constituents of M, that is, 
the 16x16 luma block, 8x8 U block, and 8x8 V block comprising M. 


In step 1, for each of the three blocks MY, MU, and MV, we filter 
each of the (16 luma or 8 chroma) segments straddling the column 
Separating the block from the block immediately to the left of it, 
using the inter-macroblock filter and controls associated to the 
loop filter level and sharpness level. 


In step 4, we filter across the (three luma and one each for U and V) 
vertical subblock edges described above, this time using the 
inter-subblock filter and controls. 


Steps 2 and 4 are skipped for macroblocks that satisfy both of the 
following two conditions: 


1. Macroblock coding mode is neither B PRED nor SPLITMV; and 
2. There is no DCT coefficient coded for the whole macroblock. 


For these macroblocks, loop filtering for edges between subblocks 
internal to a macroblock is effectively skipped. This skip strategy 
significantly reduces VP8 loop-filtering complexity. 


Edges between macroblocks and those between subblocks are treated 
with different control parameters (and, in the case of the normal 
filter, with different algorithms). Except for pixel addressing, 
there is no distinction between the treatment of vertical and 
horizontal edges. Luma edges are always 16 pixels long, chroma edges 
are always 8 pixels long, and the segments straddling an edge are 
treated identically; this of course facilitates vector processing. 


Because many pixels belong to segments straddling two or more edges, 
and so will be filtered more than once, the order in which edges are 
processed given above must be respected by any implementation. 
Within a single edge, however, the segments straddling that edge are 
disjoint, and the order in which these segments are processed is 
immaterial. 


Before taking up the filtering algorithms themselves, we should 
emphasize a point already made: Even though the pixel segments 
associated to a macroblock are antecedent to the macroblock (that is, 
lie within the macroblock or in already-constructed macroblocks), a 
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macroblock must not be filtered immediately after its 


"reconstruction" (described in Section 14). Rather, the loop filter 
applies after all the macroblocks have been "reconstructed" (i.e., 
had their predictor summed with their residue); correct decoding is 


predicated on the fact that already-constructed portions of the 
current frame referenced via intra-prediction (described in 
Section 12) are not yet filtered. 


15.2. Simple Filter 


Having described the overall procedure of, and pixels affected by, 
the loop filter, we turn our attention to the treatment of individual 
segments straddling edges. We begin by describing the simple filter, 
which, as the reader might guess, is somewhat simpler than the normal 
filter. 


Note that the simple filter only applies to luma edges.  Chroma edges 
are left unfiltered. 


Roughly speaking, the idea of loop filtering is, within limits, to 
reduce the difference between pixels straddling an edge. Differences 
in excess of a threshold (associated to the loop filter level) are 
assumed to be "natural" and are unmodified; differences below the 
threshold are assumed to be artifacts of quantization and the 
(partially) separate coding of blocks, and are reduced via the 
procedures described below. While the loop filter level is in 
principle arbitrary, the levels chosen by a VP8 compressor tend to be 
correlated to quantizer levels. 


Most of the filtering arithmetic is done using 8-bit signed operands 
(having a range of -128 to +127, inclusive), supplemented by 16-bit 


temporaries holding results of multiplies. 


Sums and other temporaries need to be "clamped" to a valid signed 
8-bit range: 


===- Begin code block --------2---5--------ec-2e5o-ocec--oscoe- 
int8 c(int v) 
{ 
return (int8) (v < -128 ? -128 : (v < 128 ? v : 127)); 
} 


ssccuEnd.oode block, SEE SESEŠE ESS 29 SS SE 
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Since pixel values themselves are unsigned 8-bit numbers, we need to 
convert between signed and unsigned values: 


šče; Begin code block) 2223533522252 Saas aS Sasa 


/* Convert pixel value (0 <= v <= 255) to an 8-bit signed 
number. */ 
int8 u2s(Pixel v) { return (int8) (v - 128);} 


/* Clamp, then convert signed number back to pixel value. */ 
Pixel sžu(int v) ( return (Pixel) (c(v) + 128);) 


===> nd eode block. Sass SS TEO 5-5 ae SS Se o x. (iII?! 


Filtering is often predicated on absolute-value thresholds. The 
following function is the eguivalent of the standard library function 
abs, whose prototype is found in the standard header file stdlib.h. 
For us, the argument v is always the difference between two pixels 
and lies in the range -255 << v <= +255. 


oe Begin code bloki ET TET prs omo A opos 
int abs(int v) { return v < 0? -v : v;} 
sz End code block seo mo +92 +22 2442 


An actual implementation would of course use inline functions or 
macros to accomplish these trivial procedures (which are used by both 
the normal and simple loop filters). An optimal implementation would 
probably express them in machine language, perhaps using single 
instruction, multiple data (SIMD) vector instructions. On many SIMD 
processors, the saturation accomplished by the above clamping 
function is often folded into the arithmetic instructions themselves, 
obviating the explicit step taken here. 


To simplify the specification of relative pixel positions, we use the 
word "before" to mean "immediately above" (for a vertical segment 
straddling a horizontal edge) or "immediately to the left of" (for a 
horizontal segment straddling a vertical edge), and the word "after" 
to mean "immediately below" or "immediately to the right of". 


Given an edge, a segment, and a limit value, the simple loop filter 
computes a value based on the four pixels that straddle the edge (two 


either side). If that value is below a supplied limit, then, very 
roughly speaking, the two pixel values are brought closer to each 
other, "shaving off" something like a guarter of the difference. The 
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same procedure is used for all segments straddling any type of edge, 
regardless of the nature (inter-macroblock, inter-subblock, luma, or 
chroma) of the edge; only the limit value depends on the edge type. 


The exact procedure (for a single segment) is as follows; the 
subroutine common adjust is used by both the simple filter presented 
here and the normal filters discussed in Section 15.3. 


==> Begin code bločk =======23=5==5===5===5=23=3=35==59==5=3=3=5 


int8 common adjust( 


int use outer taps, /* filter is 2 or 4 taps wide */ 
const Pixel "Pl, /* pixel before PO */ 
Pixel "PO, /* pixel before edge */ 
Pixel *QO, /* pixel after edge */ 
const Pixel *Q1 /* pixel after 00 */ 
) 4 
cint8 pl < u2s(*P1); /* retrieve and convert all 4 pixels %/ 
cint8 p0 = u2s(*P0); 
cint8 g0 = u2s(*Q0); 
cint8 gl = u2s(*01); 


/* Disregarding clamping, when "use_outer_taps" is false, 
"a" is 3*(q0-p0). Since we are about to divide "a" by 
8, in this case we end up multiplying the edge 
difference by 5/8. 
When "use outer taps" is true (as for the simple filter), 
"a" is pl - 3*p0 + 3*q0 - gl, which can be thought of as 
a refinement of 2*(q0 - p0), and the adjustment is 
something like (q0 - p0)/4. */ 

int8 a = c((use outer taps? c(pl - gl) : 0) + 3* (q0 - p0)); 


/* b is used to balance the rounding of a/8 in the case where 
the "fractional" part "f" of a/8 is exactly 1/2. */ 


cint8 b = (c(a + 3)) >> 3; 
/* Divide a by 8, rounding up when f »- 1/2. 

Although not strictly part of the C language, 

the right shift is assumed to propagate the sign bit. */ 
a= cla + 4) >> 3; 


/* Subtract "a" from q0, "bringing it closer" to pO. */ 


*00 = s2u(q0 - a); 
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/* Add "a" (with adjustment "b") to p0, "bringing it closer" 
to qO. 


The clamp of "atb", while present in the reference decoder, 
is superfluous; we have -16 << a << 15 at this point. */ 


*PO = s2u(p0 + 5b); 


return a; 


} 

eS 75nd. code block '==S2=22=25=53 A Sasser 
sc Begin code block === 25292222222 
void simple segment( 


uint8 edge limit, /* do nothing if edge difference 
exceeds limit */ 


const Pixel "Pl, /* pixel before PO */ 
Pixel "PO, /* pixel before edge */ 
Pixel *QO, /* pixel after edge */ 
const Pixel *Q1 /* pixel after 00 */ 


if ((abs(*PO - *Q0)*2 + abs(*P1 - *Q1)/2) <= edge limit)) 
common adjust(1, P1, PO, 00, 01); /* use outer taps */ 
} 


=== Eng codes block: 5s-----5 5-5 55-306 Noo?-o "to USII' 


We make a couple of remarks about the rounding procedure above. When 
b is zero (that is, when the "fractional part" of a is not 1/2), we 
are (except for clamping) adding the same number to p0 as we are 
subtracting from q0. This preserves the average value of p0 and go, 
but the resulting difference between pO and q0 is always even; in 
particular, the smallest non-zero gradation +-1 is not possible here. 


When b is one, the value we add to p0 (again except for clamping) is 
one less than the value we are subtracting from q0. In this case, 
the resulting difference is always odd (and the small gradation +-1 
is possible), but the average value is reduced by 1/2, yielding, for 
instance, a very slight darkening in the luma plane. (In the very 
unlikely event of appreciable darkening after a large number of 
interframes, a compressor would of course eventually compensate for 
this in the selection of predictor and/or residue.) 
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l5; 


The derivation of the edge limit value used above, which depends on 
the loop filter level and sharpness level, as well as the type of 
edge being processed, will be taken up after we describe the normal 
loop filtering algorithm below. 


3. Normal Filter 


The normal loop filter is a refinement of the simple loop filter; all 
of the general discussion above applies here as well. In particular, 
the functions c, u2s, s2u, abs, and common adjust are used by both 
the normal and simple filters. 


As mentioned above, the normal algorithms for inter-macroblock and 
inter-subblock edges differ.  Nonetheless, they have a great deal in 
common: They use similar threshold algorithms to disable the filter 
and to detect high internal edge variance (which influences the 
filtering algorithm). Both algorithms also use, at least 
conditionally, the simple filter adjustment procedure described 
above. 


The common thresholding algorithms are as follows. 
--s-— Begin code block -s-eo remo 5 5-5 Eo -I- OP xP O ost 


/* All functions take (among other things) a segment (of length 
at most 4 + 4 = 8) symmetrically straddling an edge. 


The pixel values (or pointers) are always given in order, 
from the "beforemost" to the "aftermost". So, fora 
horizontal edge (written ws an 8-pixel segment would be 
ordered p3 p2 pl pod | g0 gl g2 q3. */ 


/* Filtering is disabled if the difference between any two 
adjacent "interior" pixels in the 8-pixel segment exceeds 
the relevant threshold (I). A more complex thresholding 
calculation is done for the group of four pixels that 
straddle the edge, in line with the calculation in 
simple segment() above. %/ 
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int filter yes( 
uint8 I, /* limit on interior differences */ 
uint8 E, /* limit at the edge */ 


cint8 p3, cint8 p2, cint8 pl, cint8 p0, /* pixels before 
edge */ 

cint8 q0, cint8 gl, cint8 g2, cint8 q3 /* pixels after 
edge */ 


return  (abs(p0 - q0)*2 + abs(pl - gl)/2) <= E 
&& abs(p3 - p2) << I && abs(p2 - pl) <= I 88 
abs(pl - p0) << I 
&& abs(q3 - q2) << I && abs(q2 - gl) << I 68 
abs(gl - q0) << I; 


} 
Sot oje EOS AD LO Uk a r OO Oa 
šče Begin code block RiR Io ooo EDI O A o ote 


/* Filtering is altered if (at least) one of the differences 
on either side of the edge exceeds a threshold (we have 
"high edge variance"). */ 


int hev( 

uint8 threshold, 

cint8 pl, cint8 p0, /* pixels before edge */ 

cint8 q0, cint8 gl /* pixels after edge */ 
) 4 

return abs(pl - p0) » threshold | | abs(gl - g0) > threshold; 
} 


ss-c-uEndacocode block ====2535====p=995== 1995 === PSSS IP == 


The subblock filter is a variant of the simple filter. In fact, if 
we have high edge variance, the adjustment is exactly as for the 
simple filter. Otherwise, the simple adjustment (without outer taps) 
is applied, and the two pixels one step in from the edge pixels are 
adjusted by roughly half the amount by which the two edge pixels are 
adjusted; since the edge adjustment here is essentially 3/8 the edge 
difference, the inner adjustment is approximately 3/16 the edge 
difference. 


Bankoski, et al. Informational [Page 92] 


RFC 6386 VP8 Data Format and Decoding Guide November 2011 


===="Begin code, block sze 5-5 5 5-5 +22 +52 


void subblock filter( 

uint8 hev threshold, /* detect high edge variance */ 

uint8 interior limit, /* possibly disable filter */ 

uint8 edge limit, 

cint8 *P3, cint8 *P2, int8 "Pl, int8 "PO, /* pixels before 
edge */ 

int8 "00, int8 "01, cint8 *Q2, cint8 *Q3 /* pixels after 
edge */ 


cint8 p3 = u2s(*P3), p2 = u2s("P2), pl = u2s(*P1), 
pO = u2s(*P0); 

cint8 q0 = u2s(*Q0), gl 
q3 = u2s(*03); 


u2s(*Q1), q2 u2s(*Q2), 


if (filter yes(interior limit, edge limit, q3, q2, gl, go, 
p0, pl, p2, p3)) 
{ 
const int hv = hev(hev_threshold, pl, p0, g0, gl); 


cint8 a = (common adjust(hv, P1, PO, 00, Q1) + 1) >> 1; 


if (!hv) 4 
*Ol = s2u(ql - a); 
*P1 = s2u(pl + a); 


} 
SSS End Code: block SSsS sess o oro oo 5-5 DS OE SSeS SSSSSs5S555 


The inter-macroblock filter has potentially wider scope. If the edge 
variance is high, it performs the simple adjustment (using the outer 
taps, just like the simple filter and the corresponding case of the 
normal subblock filter). If the edge variance is low, we begin with 
the same basic filter calculation and apply multiples of it to pixel 
pairs symmetric about the edge; the magnitude of adjustment decays as 
we move away from the edge and six of the pixels in the segment are 
affected. 
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icc Begin code block sz-e roso o 5 5-5 2222 925222 


void MBfilter ( 

uint8 hev_threshold, /* detect high edge variance */ 

uint8 interior limit, /* possibly disable filter */ 

uint8 edge limit, 

cint8 *P3, int8 *P2, int8 "Pl, int8 "PO,  /% pixels before 
edge */ 

int8 "00, int8 "01, int8 *Q2, cint8 *Q3 /* pixels after 
edge */ 


cint8 p3 = u2s(*P3), p2 = u2s(*P2), pl = u2s(*P1), 
pO = u2s(*P0); 

cint8 q0 = u2s(*Q0), gl 
q3 = u2s(*03); 


u2s(*Q1), q2 u2s(*Q2), 


if (filter yes(interior limit, edge limit, q3, q2, gl, go, 
pO, pl, p2, p3)) 
( 
if (!hev(hev threshold, pl, p0, q0, gl)) 
{ 
/* Same as the initial calculation in "common_adjust", 
w is something like twice the edge difference */ 


const int8 w = c(c(pl - ql) + 3"(g0 - p0)); 

/* 9/64 is approximately 9/63 - 1/7, and 1<<7 < 128 - 
2*64. So this a, used to adjust the pixels adjacent 
to the edge, is something like 3/7 the edge 
difference. */ 

int8 a = c((27*w + 63) >> 7); 

*00 = s2u(q0 - a); *PO = s2u(p0 + a); 

/* Next two are adjusted by 2/7 the edge difference */ 

a = c((18*w + 63) >> 7); 

*Ol = s2u(ql - a); *P1 = s2u(pl + a); 

/* Last two are adjusted by 1/7 the edge difference */ 


a = c((9*w + 63) >> 7); 


*02 = s2u(q2 - a); *P2 = s2u(p2 + a); 
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15. 


) else /* if hev, do simple filter */ 
common adjust(1, P1, PO, 00, 01); /* using outer 
taps */ 


} 
2ccc-uEnd code; DblOCk, =HsSsS5SSSSSHes S55 oases SSS SSeS SSeS sesese 
4. Calculation of Control Parameters 


We conclude the discussion of loop filtering by showing how the 
thresholds supplied to the procedures above are derived from the two 
control parameters sharpness_level (an unsigned 3-bit number having 
maximum value 7) and loop filter level (an unsigned 6-bit number 
having maximum value 63). 


While the sharpness level is constant over the frame, individual 
macroblocks may override the loop filter level with one of four 
possibilities supplied in the frame header (as described in 
Section 10). 


Both the simple and normal filters disable filtering if a value 
derived from the four pixels that straddle the edge (2 either side) 
exceeds a threshold / limit value. 


====.Begin ¿code block <------S-—--—------- 5 -----------— 
/* Luma and Chroma use the same inter-macroblock edge limit */ 


uint8 mbedge limit = ((loop filter level + 2) * 2) + 
interior limit; 


/* Luma and Chroma use the same inter-subblock edge limit */ 
uint8 sub bedge limit = (loop filter level " 2) + interior limit; 


===> na “code block. =======*=S=22=HP 39239 ee aS ae 


The remaining thresholds are used only by the normal filters. The 
filter-disabling interior difference limit is the same for all edges 
(luma, chroma, inter-subblock, inter-macroblock) and is given by the 
following. 
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s--— Begin code, block =========2222=+22325+ 922222925222 
uint8 interior limit - loop filter level; 


if (sharpness level) 


{ 


interior limit >>=  sharpness level > 4? 2: 1; 
if (interior limit > 9 - sharpness level) 
interior limit < 9 - sharpness level; 
} 
if (!interior_limit) 
interior_limit = 1; 


cucÉ End code Dlock sss Ss Io seh isso eS o PO SSeS 29 5 


Finally, we give the derivation of the high edge-variance threshold, 
which is also the same for all edge types. 


235 Begimicode block! Haass SaaS ata ao Eno IMO sass 
uint8 hev_threshold = 0; 


if (we_are_decoding_akey_frame) /* current frame is a key frame */ 
{ 
if (loop_filter_level >= 40) 
hev_threshold = 2; 
else if (loop_filter_level >= 15) 
hev_threshold = 1; 


else /* current frame is an interframe */ 


if (loop_filter_level >= 40) 
hev_threshold = 3; 
else if (loop_filter_level >= 20) 
hev_threshold = 2; 
else if (loop_filter_level >= 15) 
hev_threshold = 1; 
} 


== Bnd code block: === SS o osmo oo mnm So id] sit 
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16.  Interframe Macroblock Prediction Records 


We describe the layout and semantics of the prediction records for 
macroblocks in an interframe. 


After the feature specification (which is described in Section 10 and 
is identical for intraframes and interframes), there comes a 

Bool (prob intra), which indicates inter-prediction (i.e., prediction 
from prior frames) when true and intra-prediction (i.e., prediction 
from already-coded portions of the current frame) when false. The 
zero-probability prob intra is set by field J of the frame header. 


16.1.  Intra-Predicted Macroblocks 
For intra-prediction, the layout of the prediction data is 
essentially the same as the layout for key frames, although the 


contexts used by the decoding process are slightly different. 


As discussed in Section 8, the "outer" Y mode here uses a different 
tree from that used in key frames, repeated here for convenience. 


sec Begin code blOočkUš5ačE5E5E5e 55 E5 a ESS oES So to-- 


const tree index ymode tree [2 * (num ymodes - 1)] < 
{ 
-DC PRED, 2, /* root: DC PRED - "0", "1" subtree */ 
4, 6, /* "1" subtree has 2 descendant subtrees */ 
-V PRED, -H PRED, /* "10" subtree: V PRED = "100", 
H PRED < "101" */ 
—-IM PRED, -B PRED /x "11" subtree: TM PRED < "110", 
B PRED < "111" */ 


n 
a7 os RNG Code. block a Saas ROS a EC 


The probability table used to decode this tree is variable. As 
described in Section 11, it (along with the similarly treated UV 
table) can be updated by field J of the frame header. Similar to the 
coefficient-decoding probabilities, such updates are cumulative and 
affect all ensuing frames until the next key frame or explicit 
update. The default probabilities for the Y and UV tables are: 


zo o Begin code; blOčk, Ss Sasa setae Sas oo BB moe 


Prob ymode prob [num ymodes - 1] < ( 112, 86, 140, 37); 
Prob uv mode prob [num uv modes - 1] < ( 162, 101, 204); 


=== Bnd code block -e oo 5 žo 55- gat PI me ao RS ot? odo 
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These defaults must be restored after detection of a key frame. 


Just as for key frames, if the Y mode is B PRED, there next comes an 
encoding of the intra bpred mode used by each of the sixteen Y 
subblocks. These encodings use the same tree as does that for key 
frames but, in place of the contexts used in key frames, these 
encodings use the single fixed probability table. 


==== Begin code block: ========3==========32=5=233=5==5===5H3=3=5 
const Prob bmode prob [num intra bmodes - 1] = ( 
1205-905 79, 133, 87, 95, 80/ lll, 151 
y; 
—-es- blid, Code Dlock-"=======:==55=899=S99 pS Sasa mo po s? 
Last comes the chroma mode, again coded using the same tree as that 
used for key frames, this time using the dynamic uv mode prob table 


described above. 


The calculation of the intra-prediction buffer is identical to that 
described for key frames in Section 12. 


2.  Inter-Predicted Macroblocks 
Otherwise (when the above bool is true), we are using 


inter-prediction (which of course only happens for interframes), to 
which we now restrict our attention. 


The next datum is then another bool, B(prob last), selecting the 


reference frame. If 0, the reference frame is the previous frame 
(the last frame); if 1, another bool (prob gf) selects the reference 
frame between the golden frame (0) and the altref frame (1). The 


probabilities prob last and prob gf are set in field J of the frame 
header. 


Together with setting the reference frame, the purpose of inter-mode 
decoding is to set a motion vector for each of the sixteen Y 
subblocks of the current macroblock. These settings then define the 
calculation of the inter-prediction buffer (detailed in Section 18). 
While the net effect of inter-mode decoding is straightforward, the 
implementation is somewhat complex; the (lossless) compression 
achieved by this method justifies the complexity. 
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After the reference frame selector comes the mode (or motion vector 
reference) applied to the macroblock as a whole, coded using the 
following enumeration and tree. Setting mv nearest - num ymodes is a 
convenience that allows a single variable to unambiguously hold an 
inter- or intra-prediction mode. 


scc-Beginscode-bloGk o ===0=======s==5=5950===29855==555=5=5 


typedef enum 
{ 


mv_nearest = num_ymodes, /* use "nearest" motion vector 
for entire MB */ 
mv_near, /* use "next nearest" "" */ 
mv zero, /* use zero "" */ 
mv. new, /* use explicit offset from 
implicat t -xy 
mv_split, /* use multiple motion vectors */ 
num_mv_refs = mv_split + 1 - mv_nearest 
} 
mv ref; 
const tree index mv ref tree [2 * (num mv refs - 1)] - 
{ 
-mv zero, 2, /* zero = "0" */ 
-mv nearest, 4, /* nearest < "10" */ 
-mv near, 6, /* near = "110" */ 
-mv new, -mv split /* new < "1110", split - "1111" */ 


y; 

===> “Eng. code block  ========= === === 525 =====39 === == 

3. Mode and Motion Vector Contexts 

The probability table used to decode the mv_ref, along with three 
reference motion vectors used by the selected mode, is calculated via 
a survey of the already-decoded motion vectors in (up to) 3 nearby 


macroblocks. 


The algorithm generates a sorted list of distinct motion vectors 


adjacent to the search site. The best_mv is the vector with the 
highest score. The mv_nearest is the non-zero vector with the 
highest score. The mv_near is the non-zero vector with the next 
highest score. The number of motion vectors coded using the SPLITMV 


mode is scored using the same weighting and is returned with the 
Scores of the best, nearest, and near vectors. 
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The three adjacent macroblocks above, left, and above-left are 
considered in order. If the macroblock is intra-coded, no action is 
taken. Otherwise, the motion vector is compared to other previously 
found motion vectors to determine if it has been seen before, and if 
SO contributes its weight to that vector; otherwise, it enters a new 
vector in the list. The above and left vectors have twice the weight 
of the above-left vector. 


As is the case with many contexts used by VP8, it is possible for 
macroblocks near the top or left edges of the image to reference 
blocks that are outside the visible image.  VP8 provides a border of 
1 macroblock filled with 0x0 motion vectors left of the left edge, 
and a border filled with 0,0 motion vectors of 1 macroblocks above 
the top edge. 


Much of the process is more easily described in C than in English. 
The reference code for this can be found in modemv.c (Section 20.11). 
The calculation of reference vectors, probability table, and, 
finally, the inter-prediction mode itself is implemented as follows. 


re Begin code block; === 22293722291 SÓ 


typedef union 
{ 
unsigned int as_int; 
MV as_mv; 
) int mv; /* facilitates rapid equality tests */ 


static void mv bias(MODE INFO *x,int refframe, int mv *mvp, 
int * ref frame sign bias) 


{ 


MV xmv; 
xmv = x->mbmi.mv.as mv; 
if ( ref frame sign bias[x-»mbmi.ref frame] !- 


ref frame sign bias[refframe] ) 


{ 
xmv.row*--1; 
xmv.col*--1; 
} 
mvp-»as mv = xmv; 


} 


scccuEndooode DlOCk ==325 5255252929229 2 E OSE tole 
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==== Begin code block sze o 5-- o ooo 5 922222 +2 222 


void vp8 clamp mv(MV "mv, const MACROBLOCKD *xd) 
if ( mv->col < (xd->mb to left edge - LEFT TOP MARGIN) ) 
mv->col - xd->mb to left edge - LEFT TOP MARGIN; 
else if ( mv->col > xd->mb to right edge t RIGHT BOTTOM MARGIN ) 
mv->col < xd->mb to right edge t RIGHT BOTTOM MARGIN; 


if ( mv->row < (xd->mb to top edge - LEFT TOP MARGIN) ) 
mv->row < xd->mb to top edge - LEFT TOP MARGIN; 
else if ( mv->row > xd->mb to bottom edge + RIGHT BOTTOM MARGIN ) 
mv->row = xd->mb to bottom edge + RIGHT BOTTOM MARGIN; 
} 


čo - End code block SaaS akanSes sss SSS ras See SS ROSES 


In the function vp8_find_near_mvs(), the vectors "nearest" and "near" 
are used by the corresponding modes. 


The vector best_mv is used as a base for explicitly coded motion 
vectors. 


The first three entries in the return value cnt are (in order) 
weighted census values for "zero", "nearest", and "near" vectors. 

The final value indicates the extent to which SPLITMV was used by the 
neighboring macroblocks. The largest possible "weight" value in each 
case is 5. 


čer Begin code block: Aisti nnn a A nA ee Sessa 


void vp8 find near mvs 
( 
MACROBLOCKD "xd, 
const MODE INFO *here, 
MV *nearest, 
MV *near, 
MV *best mv, 
int cnt[4], 
int refframe, 
int * ref frame sign bias 
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const MODE INFO "above - here - xd->mode info stride; 
const MODE INFO *left - here - 1; 
const MODE INFO *aboveleft = above - 1; 


int mv near mvs[4]; 
int mv *mv — near mvs; 
int *cntx = cnt; 


enum (CNT ZERO, CNT NEAREST, CNT NEAR, CNT SPLITMV); 
/* Zero accumulators */ 
mv[0].as int < mv[1].as int < mv[2].as int - 0; 


cnt[0] < cnt[1] < cnt[2] < cnt[3] < 0; 


/* Process above */ 


if (above->mbmi.ref frame != INTRA FRAME) 1 
if (above->mbmi.mv.as int) 1 
(++mv)->as_int = above->mbmi.mv.as int; 
mv bias(above, refframe, mv, ref frame sign bias); 
++cntx; 


} 
*cntx += 2; 


} 


/* Process left */ 
if (left->mbmi.ref frame != INTRA FRAME) { 
if (left->mbmi.mv.as int) 14 
int mv this mv; 


this mv.as int = left->mbmi.mv.as int; 


mv bias(left, refframe, &this mv, ref frame sign bias); 


if (this mv.as int != mv-»as int) { 
(++mv)->as_int = this mv.as int; 
++cntx; 


} 
*cntx += 2; 
} else 
cnt[CNT ZERO] += 2; 
} 


/* Process above left */ 
if (aboveleft->mbmi.ref frame != INTRA FRAME) { 
if (aboveleft->mbmi.mv.as int) 1 
int mv this mv; 


this mv.as int - aboveleft->mbmi.mv.as int; 
mv bias(aboveleft, refframe, &this mv, 
ref frame sign bias); 
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if (this mv.as int != mv->as int) { 
(--mv)-»as int = this mv.as int; 
++cntx; 
) 
*cntx += 1; 
) else 
cnt[CNT ZERO] += 1; 
} 
/* If we have three distinct MVs ... */ 
if (cnt[CNT SPLITMV]) 4 
/* See if above-left MV can be merged with NEAREST */ 
if (mv->as int == near mvs[CNT NEAREST].as int) 
cnt[CNT NEAREST] += 1; 
} 
cnt [CNT_SPLITMV] = ((above->mbmi.mode == SPLITMV) 
+ (left->mbmi.mode == SPLITMV)) * 2 
+ (aboveleft->mbmi.mode == SPLITMV); 


/* Swap near and nearest if necessary */ 
(cnt[CNT NEAR] > cnt [CNT_NEAREST]) 4 


if 


} 
/* 


if 


int tmp; 

tmp = cnt[CNT NEAREST]; 

cnt[CNT NEAREST] = cnt[CNT NEAR]; 
cnt[CNT NEAR] = tmp; 

tmp = near mvs[CNT NEAREST].as int; 


near mvs[CNT NEAREST].as int < near mvs[CNT NEAR].as int; 


near mvs[CNT NEAR].as int - tmp; 


Use near mvs[0] to store the "best" MV */ 


(cnt [CNT NEAREST] >= cnt[CNT ZERO]) 
near mvs[CNT ZERO] -» near mvs[CNT NEAREST]; 


/* Set up return values */ 


*best mv - 


*nearest 
*near = near mvs[CNT NEAR].as mv; 


near mvs[0].as mv; 
= near mvs[CNT NEAREST].as mv; 


vp8 clamp mv(nearest, xd); 
vp8 clamp mv(near, xd); 
vp8 . clamp mv (best. mv, xd); //TODO: Move this up before 


} 


the copy 


+= End code block === Soho essere IS see e as 
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The mv ref probability table (mv ref p) is then derived from the 
census as follows. 
šče Begin code block: ===235>235+222 2222922235222 HSE2 


const int vp8_mode_contexts[6][4] = 


{ 


{ 7, 1, 1, 143, ), 
( 14, 18, 14, 107, Y 
( 135, 64, 57, 68, y, 
( 60, 56, 128, 65, i 
( 159, 134, 128, 34, k, 
( 234, 188, 128, 28, js 


} 

===="End code block '====3:2==H22329= Soa nae SS Sas 
===) Begin. code bločkšr5Emi:----S o I5 EPS Saas SSeS Raa Se do- 
vp8_prob *vp8_mv_ref_probs (vp8_prob mv_ref_p[VP8_MVREFS-1], 


int cnt[4]) 
{ 


mv_ref_p[0] = vp8_mode_contexts [cnt[0]] [0]; 
mv_ref_p[1] = vp8_mode_contexts [cnt[1]] [1]; 
mv ref p[2] = vp8 mode contexts [cnt[2]] [2]; 
mv ref p[3] = vp8 mode contexts [cnt[3]] [3]; 


return p; 


} 
Soon) Ena ¡code i DLO CK SSeS SaaS a Se Sse 
Once mv_ref_p is established, the mv_ref is decoded as usual. 
3533 Begin code DLOCK? Sse Sa r AA Sa IE TO Sasss 
mvr = (mv_ref) treed_read(d, mv_ref_tree, mv_ref_p); 
sč-- End code block === =+H293293 97 HS o Mo ss?! ss?! 
For the first four inter-coding modes, the same motion vector is used 


for all the Y subblocks. The first three modes use an implicit 
motion vector. 
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D E FSSD ASS a Se ss ae Ss Ses esse Se + 
| Mode | Instruction 
Tutte Fase E EL E EE t 
| mv nearest | Use the nearest vector returned by 
| | vp8. find near mvs. 
| mv_near | Use the near vector returned by vp8_find_near_mvs. | 
| 
mv_zero | Use a zero vector; that is, predict the current | 
| macroblock from the corresponding macroblock in the 
| | prediction frame. | 
| NEWMV | This mode is followed by an explicitly coded motion | 
| | vector (the format of which is described in the next | 
| | section) that is added (component-wise) to the | 
best_mv reference vector returned by find_near_mvs 
and applied to all 16 subblocks. 
4------------ q-—---------------------------------------------------- t 


16.4. Split Prediction 


The remaining mode (SPLITMV) causes multiple vectors to be applied to 
the Y subblocks. It is immediately followed by a partition 
Specification that determines how many vectors will be specified and 
how they will be assigned to the subblocks. The possible partitions, 
with indicated subdivisions and coding tree, are as follows. 


====""Begán code. block SSS sss — 555-555 see Bm popiti 


typedef enum 
mv top bottom, /* two pieces (0...7) and (8...15) */ 
mv left right, /* (0,1,4,5,8,9,12,13) and 
{2,3,6,7,10,11,14,15} %/ 


mv quarters, /* {0,1,4,5}, {2,3,6,7}, {8,9,12,13}, 
{10,11,14,15} */ 

MV_16, /* every subblock gets its own vector 
103-5) El 


mv num partitions 


) 
MVpartition; 
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const tree index mvpartition tree [2 * (mvnum partition - 1)] < 
{ 
-MV 16, 2, /* MV 16 < "0" */ 
-mv quarters, 4, /* mv quarters < "10" */ 
-mv top bottom, -mv left right /* top bottom - "110", 
left right - "111" */ 


y; 
===> ¿Eng Code: block ===5239==== SSS SSS SSS 52 SS SSS se PSSS ESSE 
The partition is decoded using a fixed, constant probability table: 


====""Begán Code. block, ======2=========2===2=2====HpH=322===5 


const Prob mvpartition_probs [mvnum_partition - 1] = 
{ 110, 111, 150}; 
part = (MVpartition) treed_read(d, mvpartition_tree, 


mvpartition probs); 
zco-ccodEndscode block .-====2 == +29 SS O O SS SS SA 


After the partition come two (for mv top bottom or mv left right), 
four (for mv quarters), or sixteen (for MV 16) subblock 


inter-prediction modes. These modes occur in the order indicated by 
the partition layouts (given as comments to the MVpartition enum) and 
are coded as follows. (As was done for the macroblock-level modes, 


we offset the mode enumeration so that a single variable may 
unambiguously hold either an intra- or inter-subblock mode.) 


Prior to decoding each subblock, a decoding tree context is chosen as 
illustrated in the code snippet below. The context is based on the 
immediate left and above subblock neighbors, and whether they are 
equal, are zero, or a combination of those. 


----— Begin code. block .+====2===2=======232=2==2=S====p2=H2===5 
typedef enum 


{ 


LEFT4x4 = num_intra_bmodes, /* use already-coded MV to 


my left */ 
ABOVE4x4, /* use already-coded MV above me */ 
ZERO4x4, /* use zero MV */ 
NEW4x4, /* explicit offset from "best" */ 


num sub mv ref 
y; 


sub_mv_ref; 
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const tree index sub mv ref tree [2 * (num sub mv ref - 1)] < 
{ 
-LEFTAXA, 2, /* LEFT = "0" */ 

-ABOVE4X4, 4, /* ABOVE = "10" */ 

—-ZERO4X4, -NEWAXA /* ZERO < "110", NEW < "111" */ 


y; 


/* Choose correct decoding tree context 

* Function parameters are left subblock neighbor MV and above 
* subblock neighbor MV */ 

int vp8_mvCont (MV *1, MV*a) 

{ 


int lez = (l->row == 0 && l->col << 0); /* left neighbor 
is zero */ 

int aez = (a->row == 0 && a->col == 0); /* above neighbor 
is zero */ 

int lea = (l->row == a->row && l->col == a->col); /* left 


neighbor equals above neighbor */ 


if (lea && lez) 
return SUBMVREF_LEFT_ABOVE_ZED; /* =4 */ 


if (lea) 
return SUBMVREF_LEFT_ABOVE_SAME; /* =3 */ 


if (aez) 
return SUBMVREF_ABOVE_ZED; /* =2 */ 


if (lez) 
return SUBMVREF_LEFT_ZED; /* -1*/ 


return SUBMVREF NORMAL; /* <0 */ 
) 


/* Constant probabilities and decoding procedure. */ 


const Prob sub mv ref prob [5] [num sub mv ref - 1] = { 
147,136,18 ), 
106,145,1 }, 
179,121,1 ], 
223,1 ,34 ), 
208,1 ,1 `} 


TH A a c4 oc 


y; 


sub_ref = (sub_mv_ref) treed_read(d, sub_mv_ref tree, 
sub mv ref prob[context]); 


—-e—hlid code block -aes 5 žo 55-55 PI cesta o sodec 
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17. 


Ts 


The first two sub-prediction modes simply copy the already-coded 
motion vectors used by the blocks above and to the left of the 
subblock at the upper left corner of the current subset (i.e., 
collection of subblocks being predicted). These prediction blocks 
need not lie in the current macroblock and, if the current subset 
lies at the top or left edges of the frame, need not lie in the 
frame. In this latter case, their motion vectors are taken to be 
zero, as are subblock motion vectors within an intra-predicted 
macroblock. Also, to ensure the correctness of prediction within 
this macroblock, all subblocks lying in an already-decoded subset of 
the current macroblock must have their motion vectors set. 


ZERO4x4 uses a zero motion vector and predicts the current subset 
using the corresponding subset from the prediction frame. 


NEW4x4 is exactly like NEWMV except that NEW4x4 is applied only to 
the current subset. It is followed by a two-dimensional motion 
vector offset (described in the next section) that is added to the 
best vector returned by the earlier call to find near mvs to form the 
motion vector in effect for the subset. 


Parsing of both inter-prediction modes and motion vectors (described 
next) can be found in the reference decoder file modemv.c 
(Section 20.11). 


Motion Vector Decoding 


As discussed above, motion vectors appear in two places in the VP8 
datastream: applied to whole macroblocks in NEWMV mode and applied to 
subsets of macroblocks in NEW4x4 mode. The format of the vectors is 
identical in both cases. 


Each vector has two pieces: a vertical component (row) followed by a 
horizontal component (column). The row and column use separate 
coding probabilities but are otherwise represented identically. 


1. Coding of Each Component 


Each component is a signed integer V representing a vertical or 
horizontal luma displacement of V quarter-pixels (and a chroma 

displacement of V eighth-pixels). The absolute value of V, if 

non-zero, is followed by a boolean sign. V may take any value 

between -1023 and +1023, inclusive. 
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The absolute value A is coded in one of two different ways according 
to its size. For 0 << A << 7, A is tree-coded, and for 8 << A <= 
1023, the bits in the binary expansion of A are coded using 
independent boolean probabilities. The coding of A begins with a 
bool specifying which range is in effect. 


Decoding a motion vector component then reguires a 19-position 
probability table, whose offsets, along with the procedure used to 
decode components, are as follows: 


—---.Begin code block —-------------------S---o-------------— 


typedef enum 


mvpis short, /* short (<= 7) vs long (>= 8) */ 
MVPsign, / sign for non-zero %/ 

MVPshort, /* 8 short values = 7-position tree */ 
MVPbits = MVPshort + 7, /* 8 long value bits 


w/independent probs */ 


MVPcount = MVPbits + 10 /* 19 probabilities in total */ 
} 
MVPindices; 
typedef Prob MV CONTEXT [MVPcount]; /* Decoding spec for 


a single component */ 


/* Tree used for small absolute values (has expected 


correspondence). */ 

const tree index small mvtree [2 * (8 - 1)] < 

{ 

2, 8, /* "O" subtree, "1" subtree */ 
4, 6, /* "00" subtree, "01" subtree */ 
ED sly /* 0 = "000", 1 = "001" */ 
n. 3 /* 2 < "010", 3 < "011" */ 
10, 12, /* "10" subtree, "11" subtree */ 
-4, -5, /* 4 = "100", 5 = "101" */ 
=6; 7 /* 6 = "110", 7 = "111" */ 


y; 


/* Read MV component at current decoder position, using 
supplied probs. */ 


int read mvcomponent (bool decoder *d, const MV CONTEXT *mvc) 


{ 


const Prob * const p = (const Prob *) mvc; 
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int A < 0; 


if (read_bool(d, p [mvpis_short])) /* 8 << A <= 1023 */ 
( 
/* Read bits 0, 1, 2 */ 


int i = 0; 
do ( A += read bool(d, p [MVPbits + i]) << i;) 
while (++i « 3); 


/* Read bits 9, 8, 7, 6, 5, 4 */ 
qued 
do ( A += read bool(d, p [MVPbits + i]) << i;) 
while (--i > 3); 
/* We know that A >= 8 because it is coded long, 
so if A << 15, bit 3 is one and is not 


explicitly coded. */ 


if (!(A & OxfffO) || read bool(d, p [MVPbits + 3])) 
A += 8; 


else /* 0 << A <= 7 */ 
A < treed read(d, small mvtree, p t MVPshort); 


return A && read bool(r, p [MVPsign]) ? -A : A; 
} 


= oS ENO Code ble ck. ss arta om mmm mm AARÓN 


.2. Probability Updates 


The decoder should maintain an array of two MV CONTEXTs for decoding 
row and column components, respectively. These MV CONTEXTs should be 
set to their defaults every key frame. Each individual probability 
may be updated every interframe (by field J of the frame header) 
using a constant table of update probabilities. Each optional update 
is of the form B? P(7), that is, a bool followed by a 7-bit 
probability specification if true. 


As with other dynamic probabilities used by VP8, the updates remain 
in effect until the next key frame or until replaced via another 
update. 
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In detail, the probabilities should then be managed as follows. 
-==> Begin code block =============== === HSH += 


/* Never-changing table of update probabilities for each 
individual probability used in decoding motion vectors. */ 


const MV CONTEXT vp8 mv update probs[2] < 
{ 
{ 
237, 
246, 
253, 253, 254, 254, 254, 254, 254, 
254, 254, 254, 254, 254, 250, 250, 252, 254, 254 


245, 253, 254, 254, 254, 254, 254, 
254, 254, 254, 254, 254, 251, 251, 254, 254, 254 


/* Default MV decoding probabilities. */ 


const MV_CONTEXT default_mv_context[2] = 
{ 


{ // row 
162, // is short 
128, // sign 
225, 146, 172, 147, 214, 39, 156, // short tree 


128, 129, 132, 75, 145, 178, 206, 239, 254, 254 // long bits 
), 


{ // same for column 
164, // is short 


204, 170, 119, 235, 140, 230, 228, 
128, 130, 130, 74, 148, 180, 203, 236, 254, 254 // long bits 


/* Current MV decoding probabilities, set to above defaults 
every key frame. */ 


MV_CONTEXT mvc [2]; /* always row, then column */ 
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/* Procedure for decoding a complete motion vector. */ 
typedef struct ( intl6 row, col;) MV; /* as in previous section */ 


MV read mv(bool decoder *d) 


{ 


MV v; 
v.row = (int16) read mvcomponent (d, mvc); 
v.col = (int16) read mvcomponent (d, mvc + 1); 


return v; 


} 


/* Procedure for updating MV decoding probabilities, called 
every interframe with "d" at the appropriate position in 
the frame header. */ 


void update_mvcontexts (bool_decoder *d) 


{ 


int i = 0; 
do { /* component = row, then column */ 
const Prob *up = mv update probs[i]; /* update probs 
for component */ 
Prob *p = mvcli]; /* start decode tbl "" */ 
Prob * const pstop = p + MVPcount; /* end decode tbl "" */ 
do { 
if (read bool(d, *up++)) /* update this position */ 
{ 
const Prob x = read_literal(d, 7); 
*p = x? x««1l: 1; 
} 
} while (++p < pstop); /* next position */ 
} while (++i < 2); /* next component */ 


} 


A End code block sss Sass lagers sate Se SSeS See ese Pio > 
This completes the description of the motion-vector decoding 


procedure and, with it, the procedure for decoding interframe 
macroblock prediction records. 
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18.  Interframe Prediction 


Given an inter-prediction specification for the current macroblock, 
that is, a reference frame together with a motion vector for each of 
the sixteen Y subblocks, we describe the calculation of the 
prediction buffer for the macroblock. Frame reconstruction is then 
completed via the previously described processes of residue summation 
(Section 14) and loop filtering (Section 15). 


The management of inter-predicted subblocks and sub-pixel 
interpolation may be found in the reference decoder file predict.c 
(Section 20.14). 


18.1. Bounds on, and Adjustment of, Motion Vectors 


Since each motion vector is differentially encoded from a neighboring 
block or macroblock and the only clamp is to ensure that the 
referenced motion vector represents a valid location inside a 
reference frame buffer, it is technically possible within the VP8 
format for a block or macroblock to have arbitrarily large motion 
vectors, up to the size of the input image plus the extended border 
areas. For practical reasons, VP8 imposes a motion vector size range 
limit of -4096 to 4095 full pixels, regardless of image size (VP8 
defines 14 raw bits for width and height; 16383x16383 is the maximum 
possible image size).  Bitstream-compliant encoders and decoders 
shall enforce this limit. 


Because the motion vectors applied to the chroma subblocks have 
1/8-pixel resolution, the synthetic pixel calculation, outlined in 
Section 5 and detailed below, uses this resolution for the luma 
subblocks as well. In accordance, the stored luma motion vectors are 
all doubled, each component of each luma vector becoming an even 
integer in the range -2046 to +2046, inclusive. 


The vector applied to each chroma subblock is calculated by averaging 
the vectors for the 4 luma subblocks occupying the same visible area 
as the chroma subblock in the usual correspondence; that is, the 
vector for U and V block 0 is the average of the vectors for the Y 
subblocks { 0, 1, 4, 5}, chroma block 1 corresponds to Y blocks { 2, 
3, 6, 7), chroma block 2 to Y blocks { 8, 9, 12, 13), and chroma 
block 3 to Y blocks ( 10, 11, 14, 15). 
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In detail, each of the two components of the vectors for each of the 
chroma subblocks is calculated from the corresponding luma vector 
components as follows: 


==== Begin.code block zero 55 ooo oo oa o-----------— 


int avg(int cl, int c2, int c3, int c4) 
{ 
int s = cl + €2-t.c3 + c4; 


/* The shift divides by 8 (not 4) because chroma pixels 
have twice the diameter of luma pixels. The handling 
of negative motion vector components is slightly 
cumbersome because, strictly speaking, right shifts 
of negative numbers are not well-defined in C. */ 


return s >= 0? (s + 4) >> 3: -((-s + 4) >> 3); 
} 


SoS ENG. "COGS (DOCK, SSS SSS SSS SSS SS Ss SSS SS SSS In poč 


Furthermore, if the version number in the frame tag specifies only 
full-pel chroma motion vectors, then the fractional parts of both 
components of the vector are truncated to zero, as illustrated in the 
following pseudocode (assuming 3 bits of fraction for both luma and 
chroma vectors): 


x = x 
Y c 


ss--uEndgcode DLOCK Far Aness RnR 


Earlier in this document we described the vp8 clamp mv() function to 
limit "nearest" and "near" motion vector predictors inside specified 
margins within the frame boundaries. Additional clamping is 
performed for NEWMV macroblocks, for which the final motion vector is 
clamped again after combining the "best" predictor and the 
differential vector decoded from the stream. 


However, the secondary clamping is not performed for SPLITMV 
macroblocks, meaning that any subblock's motion vector within the 
SPLITMV macroblock may point outside the clamping zone. These 
non-clamped vectors are also used when determining the decoding tree 
context for subsequent subblocks' modes in the vp8 mvCont() function. 
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18. 


.2. Prediction Subblocks 


The prediction calculation for each subblock is then as follows. 
Temporarily disregarding the fractional part of the motion vector 
(that is, rounding "up" or "left" by right-shifting each component 

3 bits with sign propagation) and adding the origin (upper left 
position) of the (16x16 luma or 8x8 chroma) current macroblock gives 
us an origin in the Y, U, or V plane of the predictor frame (either 
the golden frame or previous frame). 


Considering that origin to be the upper left corner of a (luma or 
chroma) macroblock, we need to specify the relative positions of the 
pixels associated to that subblock, that is, any pixels that might be 
involved in the sub-pixel interpolation processes for the subblock. 


3. Sub-Pixel Interpolation 


The sub-pixel interpolation is effected via two one-dimensional 
convolutions. These convolutions may be thought of as operating ona 
two-dimensional array of pixels whose origin is the subblock origin, 
that is the origin of the prediction macroblock described above plus 
the offset to the subblock. Because motion vectors are arbitrary, so 
are these "prediction subblock origins". 


The integer part of the motion vector is subsumed in the origin of 
the prediction subblock; the 16 (synthetic) pixels we need to 
construct are given by 16 offsets from the origin. The integer part 
of each of these offsets is the offset of the corresponding pixel 
from the subblock origin (using the vertical stride). To these 
integer parts is added a constant fractional part, which is simply 
the difference between the actual motion vector and its integer 
truncation used to calculate the origins of the prediction macroblock 
and subblock. Each component of this fractional part is an integer 
between 0 and 7, representing a forward displacement in eighths of a 
pixel. 


It is these fractional displacements that determine the filtering 
process. If they both happen to be zero (that is, we had a "whole 
pixel" motion vector), the prediction subblock is simply copied into 
the corresponding piece of the current macroblock’s prediction 
buffer. As discussed in Section 14, the layout of the macroblock’s 
prediction buffer can depend on the specifics of the reconstruction 
implementation chosen. Of course, the vertical displacement between 
lines of the prediction subblock is given by the stride, as are all 
vertical displacements used here. 
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Otherwise, at least one of the fractional displacements is non-zero. 
We then synthesize the missing pixels via a horizontal, followed by a 
vertical, one-dimensional interpolation. 


The two interpolations are essentially identical. Each uses a (at 
most) six-tap filter (the choice of which of course depends on the 
one-dimensional offset). Thus, every calculated pixel references at 
most three pixels before (above or to the left of) it and at most 
three pixels after (below or to the right of) it. The horizontal 
interpolation must calculate two extra rows above and three extra 
rows below the 4x4 block, to provide enough samples for the vertical 
interpolation to proceed. 


Depending on the reconstruction filter type given in the version 
number field in the frame tag, either a bicubic or a bilinear tap set 
is used. 

The exact implementation of subsampling is as follows. 


seže Begins. codeblock === 377335 SSS Se Se aS se SSS SSS == 


/* Filter taps taken to 7-bit precision. 
Because DC is always passed, taps always sum to 128. */ 


const int BilinearFilters[8][6] = 


{ 


{ 0, 0, 128, 0, 0, 0 ), 
(05: (Oz. T2; 16, 0, 0 ), 
(0,0, 96, 32, 0, 0 ), 
(0,0, 80, 48, 0, 0 ), 
(0,0, 64, 64, 0, 0 ), 
(0,0, 48, 80, 0, 0 ), 
(0,0, 32, 96,0, 0 ), 
{ 0, 0, 16, 112, :0,- 0j 
y; 
const int filters [8] [6] = ( /* indexed by displacement */ 
(0, 0, 128, 0, 0, 0), /* degenerate whole-pixel */ 
{ 0, -6, 123, 12; =l O ), /* 1/8 */ 
{ 2, Db X108; 36, -8, JT. uer /* 1/4 */ 
Db) 99 935 50, -6, On}; /* 3/8 */ 
{ 3, -16, 77, 77, -16, 3 ), /* 1/2 is symmetric */ 
{ 0, -6, 50, 93, -9, 0), /* 5/8 = reverse of 3/8 */ 
{ 1, -8, 36, 108, -11, 2 }, /* 3/4 = reverse of 1/4 */ 
1-07, =1, 12, 123, -6, 0) /* 7/8 = reverse of 1/8 */ 


y; 
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/* One-dimensional synthesis of a single sample. 
Filter is determined by fractional displacement */ 


Pixel interp( 
const int fil[6], /* filter to apply */ 


const Pixel *p, /* origin (rounded "before") in 
prediction area */ 
const int s /* size of one forward step "" */ 
) 4 
int32 a < 0; 
int i < 0; 
p -= s + s; /* move back two positions %/ 
do { 
a t= *p * fil[i]; 
p += s; 


} while (++i < 6); 


return clamp255((a + 64) >> 7); /* round to nearest 
8-bit value */ 


/* First do horizontal interpolation, producing intermediate 
buffer. */ 


void Hinterp ( 


Pixel temp[9] [4], /* 9 rows of 4 (intermediate) 
destination values */ 

const Pixel *p, /* subblock origin in prediction 
frame */ 

int s, /* vertical stride to be used in 
prediction frame */ 

uint hfrac, /* 0 <= horizontal displacement <= 7 */ 

uint bicubic /* 1-bicubic filter, O=bilinear */ 

) 1 
const int * const fil = bicubic ? filters [hfrac] 


BilinearFilters[hfrac]; 


int r = 0; do /* for each row */ 
( 
int c < 0; do /* for each destination sample */ 
( 
/* Pixel separation - one horizontal step - 1 */ 


temp[r][c] = interp(fil, p + c, 1); 
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while (++c < 4); 
} 
while (p += s, ++r < 9); /* advance p to next row */ 


} 


/* Finish with vertical interpolation, producing final results. 
Input array "temp" is of course that computed above. */ 


void Vinterp( 
Pixel final[4][4], /* 4 rows of 4 (final) destination values */ 
const Pixel temp[9] [4], 
uint vfrac, /* 0 << vertical displacement << 7 */ 
uint bicubic /* 1-bicubic filter, O=bilinear */ 


const int * const fil < bicubic ? filters [vfrac] 
BilinearFilters[vfrac]; 


int r = 0; do /* for each row %/ 
{ 
ine e m0; do /* for each destination sample */ 
{ 
/* Pixel separation = one vertical step = width 
of array = 4 */ 


final[r][c] = interp(fil, temp[r] + c, 4); 
} 
while (++c « 4); 


} 
while (++r < 4); 


} 


roo ENA COS LOCK oo TESS A oa ES 


4. Filter Properties 

We discuss briefly the rationale behind the choice of filters. Our 
approach is necessarily cursory; a genuinely accurate discussion 
would require a couple of books. Readers unfamiliar with signal 


processing may or may not wish to skip this. 


All digital signals are of course sampled in some fashion. The case 
where the inter-sample spacing (say in time for audio samples, or 
space for pixels) is uniform, that is, the same at all positions, is 
particularly common and amenable to analysis. Many aspects of the 
treatment of such signals are best-understood in the frequency domain 
via Fourier Analysis, particularly those aspects of the signal that 
are not changed by shifts in position, especially when those 
positional shifts are not given by a whole number of samples. 
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Non-integral translates of a sampled signal are a textbook example of 
the foregoing. In our case of non-integral motion vectors, we wish 
to say what the underlying image "really is" at these pixels; 
although we don't have values for them, we feel that it makes sense 
to talk about them. The correctness of this feeling is predicated on 
the underlying signal being band-limited, that is, not containing any 
energy in spatial frequencies that cannot be faithfully rendered at 
the pixel resolution at our disposal. In one dimension, this range 
of "OK" frequencies is called the Nyquist band; in our two- 
dimensional case of integer-grid samples, this range might be termed 
a Nyquist rectangle. The finer the grid, the more we know about the 
image, and the wider the Nyquist rectangle. 


It turns out that, for such band-limited signals, there is indeed an 
exact mathematical formula to produce the correct sample value at an 
arbitrary point. Unfortunately, this calculation requires the 
consideration of every single sample in the image, as well as needing 
to operate at infinite precision. Also, strictly speaking, all band- 
limited signals have infinite spatial (or temporal) extent, so 
everything we are discussing is really some sort of approximation. 


It is true that the theoretically correct subsampling procedure, as 
well as any approximation thereof, is always given by a translation- 
invariant weighted sum (or filter) similar to that used by VP8. It 
is also true that the reconstruction error made by such a filter can 
be simply represented as a multiplier in the frequency domain; that 
is, such filters simply multiply the Fourier transform of any signal 
to which they are applied by a fixed function associated to the 
filter. This fixed function is usually called the frequency response 
(or transfer function); the ideal subsampling filter has a frequency 
response equal to one in the Nyquist rectangle and zero everywhere 
else. 


Another basic fact about approximations to "truly correct" 
subsampling is that the wider the subrectangle (within the Nyquist 
rectangle) of spatial frequencies one wishes to "pass" (that is, 
correctly render) or, put more accurately, the closer one wishes to 
approximate the ideal transfer function, the more samples of the 
original signal must be considered by the subsampling, and the wider 
the calculation precision necessitated. 


The filters chosen by VP8 were chosen, within the constraints of 4 or 
6 taps and 7-bit precision, to do the best possible job of handling 
the low spatial frequencies near the 0th DC frequency along with 
introducing no resonances (places where the absolute value of the 
frequency response exceeds one). 
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The justification for the foregoing has two parts. First, resonances 
can produce extremely objectionable visible artifacts when, as often 
happens in actual compressed video streams, filters are applied 
repeatedly. Second, the vast majority of energy in real-world images 
lies near DC and not at the high end. 


To get slightly more specific, the filters chosen by VP8 are the best 
resonance-free 4- or 6-tap filters possible, where "best" describes 
the frequency response near the origin: The response at 0 is required 
to be 1, and the graph of the response at 0 is as flat as possible. 


To provide an intuitively more obvious point of reference, the "best" 
2-tap filter is given by simple linear interpolation between the 
surrounding actual pixels. 


Finally, it should be noted that, because of the way motion vectors 
are calculated, the (shorter) 4-tap filters (used for odd fractional 
displacements) are applied in the chroma plane only. Human color 
perception is notoriously poor, especially where higher spatial 
frequencies are involved. The shorter filters are easier to 
understand mathematically, and the difference between them and a 
theoretically slightly better 6-tap filter is negligible where chroma 
is concerned. 


Annex A: Bitstream Syntax 


This annex presents the bitstream syntax in a tabular form. All the 
information elements have been introduced and explained in the 
previous sections but are collected here for a quick reference. Each 
syntax element is briefly described after the tabular representation 
along with a reference to the corresponding paragraph in the main 
document. The meaning of each syntax element value is not repeated 
here. 


The top-level hierarchy of the bitstream is introduced in Section 4. 


Definition of syntax element coding types can be found in Section 8. 
The types used in the representation in this annex are: 


o f(n), n-bit value from stream (n successive bits, not boolean 
encoded) 


o L(n), n-bit number encoded as n booleans (with equal probability 
of being 0 or 1) 


o B(p), bool with probability p of being 0 


o T, tree-encoded value 
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1. Uncompressed Data Chunk 


| frame tag 

| if (key frame) ( 

| start_code 

| horizontal_size_code 
| vertical size code 


} 


The 3-byte frame tag can be parsed as follows: 


November 2011 


-s-c- Begin code block: ====>5===5222==2 2422254252225 


unsigned char *c = pbi->source; 
unsigned int tmp; 


tmp = (c[2] << 16) | (ct1] << 8) | crol; 


key frame = tmp & 0x1; 

version - (tmp >> 1) & 0x7; 

show frame = (tmp >> 4) & 0x1; 

first part size = (tmp >> 5) € OxT7FFFF; 


Ses) Bnd code block —-55--5 Eo o aS o > o EI mo MA lop oder 


Where: 


o key frame indicates whether the current frame is a key frame 


or not. 


o version determines the bitstream version. 


o show frame indicates whether the current frame is meant to be 


displayed or not. 


o first part size determines the size of the first partition 
(control partition), excluding the uncompressed data chunk. 
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The start code is a constant 3-byte pattern having value 0x9d012a. 
The latter part of the uncompressed chunk (after the start code) can 
be parsed as follows: 


==== Begin code block. -2---22-2---5----2--—-—---ccomoceeceo cc 


unsigned char *c = pbi->source + 6; 
unsigned int tmp; 


tmp = (c[1] << 8) | c[0]; 


width = tmp € Ox3FFF; 
horizontal scale = tmp >> 14; 


tmp = (c[3] << 8) | e621; 


height = tmp & Ox3FFF; 
vertical_scale = tmp >> 14; 


zero hd Code block a oor 
19.2. Frame Header 
| Frame Header | Type 


if (key_frame) { 


i 
| | | 
| color_space | L(1) | 
| clamping type | L(1) | 

} 
| Segmentation enabled | L(1) | 
| if (segmentation_enabled) | 
| update segmentation() | 
| filter type | L(1) | 
| loop filter level L(6) 
sharpness level L(3) 
| mb 1f adjustments () | | 
| log2 nbr of dct partitions [Mr | 
| quant indices () | | 
| if (key_frame) | | 
| refresh_entropy_probs pia) | 
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| else ( | 
| refresh_golden_frame 

refresh_alternate_frame 
| if (!refresh golden frame) 
| copy_buffer_to_golden 
| if (!refresh_alternate_frame) | 
| copy_buffer_to_alternate 
| sign_bias_golden 
| sign bias alternate 

refresh entropy probs 
| refresh last | 
} 
| token_prob_update () 
| mb_no_skip_coeff 
| if (mb_no_skip_coeff) 

prob_skip_false | L (8) 
| if (!key_frame) { 
| prob_intra | 
| prob_last | 
| prob_gf | 
| intra_16x16_prob_update_flag 

if (intra 16x16 prob update flag) { 
| for (i = 0; i < 4; i++) | 
| intra_16x16_prob | 
| | 
| | 
| | 
| | 


} 
intra_chroma prob_update_flag 
if (intra_chroma_prob_update_flag) { 
for (i = 0; i < 3; i++) 
intra_chroma_prob 
} 
mv prob update () 


O color space defines the YUV color space of the sequence 
(Section 9.2) 


o clamping type specifies if the decoder is required to clamp the 
reconstructed pixel values (Section 9.2) 


o segmentation enabled enables the segmentation feature for the 
current frame (Section 9.3) 


o filter type determines whether the normal or the simple loop 
filter is used (Sections 9.4, 15) 


o loop filter level controls the deblocking filter 
(Sections 9.4, 15) 
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o sharpness level controls the deblocking filter (Sections 9.4, 15) 


o  log2 nbr of dct partitions determines the number of separate 
partitions containing the DCT coefficients of the macroblocks 
(Section 9.5) 


o refresh entropy probs determines whether updated token 
probabilities are used only for this frame or until further update 


o refresh golden frame determines if the current decoded frame 
refreshes the golden frame (Section 9.7) 


o refresh alternate frame determines if the current decoded frame 
refreshes the alternate reference frame (Section 9.7) 


o copy buffer to golden determines if the golden reference is 
replaced by another reference (Section 9.7) 


o copy buffer to alternate determines if the alternate reference is 
replaced by another reference (Section 9.7) 


o sign bias golden controls the sign of motion vectors when the 
golden frame is referenced (Section 9.7) 


o sign bias alternate controls the sign of motion vectors when the 
alternate frame is referenced (Section 9.7) 


o refresh last determines if the current decoded frame refreshes the 
last frame reference buffer (Section 9.8) 


o mb no skip coeff enables or disables the skipping of macroblocks 
containing no non-zero coefficients (Section 9.10) 


o prob skip false indicates the probability that the macroblock is 
not skipped (flag indicating skipped macroblock is false) 
(Section 9.10) 


o prob intra indicates the probability of an intra macroblock 
(Section 9.10) 


o prob last indicates the probability that the last reference frame 
is used for inter-prediction (Section 9.10) 


o prob gf indicates the probability that the golden reference frame 
is used for inter-prediction (Section 9.10) 
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o intra 16x16 prob update flag indicates if the branch probabilities 
used in the decoding of the luma intra-prediction mode are updated 
(Section 9.10) 


o intra 16x16 prob indicates the branch probabilities of the luma 
intra-prediction mode decoding tree 


o intra chroma prob update flag indicates if the branch 
probabilities used in the decoding of the chroma intra-prediction 
mode are updated (Section 9.10) 


o intra chroma prob indicates the branch probabilities of the chroma 
intra-prediction mode decoding tree 


| update segmentation () | Type | 
| update mb segmentation map L(1) | 
| update segment feature data | ia) | 
| if (update_segment_feature_data) { | 

| segment_feature_mode | L(1) | 
| for (i = 0; i < 4; i++) 4 | | 
| quantizer update | L(1) | 

if (guantizer update) { 
| quantizer_update_value | L) | 
| quantizer update sign | L(1) | 
} 
| | | 
for (i = 0; i < 4; i++) 4 

| loop_filter_update | L(1) | 
| if (loop filter update) { | 

| lf_update_value | L(6) | 
| lf_update_sign Lab ll 
| } | | 

} 

E || 
| if (update mb segmentation map) { | 

| for (i < 0; i < 3; i++) 4 | | 
| segment_prob_update | L(1) | 
| if (segment prob update) | | 
| segment prob | L(8) | 
| | | 


o update mb segmentation map determines if the MB segmentation map 
is updated in the current frame (Section 9.3) 


o update segment feature data indicates if the segment feature data 
is updated in the current frame (Section 9.3) 
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o segment feature mode indicates the feature data update mode, 0 for 
delta and 1 for the absolute value (Section 9.3) 


o quantizer update indicates if the quantizer value is updated for 
the i^(th) segment (Section 9.3) 


o quantizer update value indicates the update value for the segment 
quantizer (Section 9.3) 


o quantizer update sign indicates the update sign for the segment 
guantizer (Section 9.3) 


o loop filter update indicates if the loop filter level value is 
updated for the i^(th) segment (Section 9.3) 


o lf update value indicates the update value for the loop filter 
level (Section 9.3) 


o lf update sign indicates the update sign for the loop filter level 
(Section 9.3) 


o segment prob update indicates whether the branch probabilities 
used to decode the segment id in the MB header are decoded from 
the stream or use the default value of 255 (Section 9.3) 


o segment prob indicates the branch probabilities of the segment id 
decoding tree (Section 9.3) 
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} 


| mb lf adjustments () | Type | 
po eee po 
| loop filter. adj enable | L(1) | 
if (loop filter adj enable) { 
| mode ref lf delta update | ia) | 
| if (mode_ref_lf_delta_update) { | | 
| for (i = 0; i < 4; i++) { | | 
| ref_frame_delta_update_flag ra | 
| if (ref frame delta update flag) { 
delta magnitude L(6) 
| delta_sign | L(1) | 
| } | | 
| } | | 
| for (i = 0; i < 4; i++) { | | 
| mb_mode_delta_update_flag | Lia) | 
| if (mb mode delta update flag) 1 | 
delta magnitude L(6) 

| delta_sign | Lia) | 
| || 
| | | 
| | | 


o loop filter adj enable indicates if the MB-level loop filter 
adjustment (based on the used reference frame and coding mode) is 
on for the current frame (Section 9.4) 


o mode ref 1f delta update indicates if the delta values used in an 
adjustment are updated in the current frame (Section 9.4) 


o ref frame delta update flag indicates if the adjustment delta 
value corresponding to a certain used reference frame is updated 
(Section 9.4) 


o delta magnitude is the absolute value of the delta value 


O delta sign is the sign of the delta value 


o mb mode delta update flag indicates if the adjustment delta value 
corresponding to a certain MB prediction mode is updated 
(Section 9.4) 
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| quant indices() | Type | 
E e utu AU eue pesi 
| y ac gi | L(7) | 

y dc delta present L(1) 
| if (y dc delta present) ( | 
| | v dc delta magnitude | L(4) | 
| y. dc, delta sign | ia) | 

} | | 
| y2_dc_delta_present L(1) 

if (y2_dc_delta_present) { 
| y2. dc, delta, magnitude | L(4) | 
| y2 de. delta sign | L(1) | 

} | | 
| y2_ac_delta_present | L(1) | 
| if (y2 ac delta present) { | 
| y2. ac. delta, magnitude | L(4) | 
| y2. ac delta sign L(1) 

} | | 
| uv dc delta present | ia) | 
| if (uv_dc_delta_present) { | | 
| | uv dc delta magnitude | L(4) | 
| uv dc. delta sign | Ll) | 

} 
| uv_ac_delta_present | L(1) | 
| if (uv ac delta present) { | | 
| uv_ac_delta_magnitude | L(4) | 
| uv ac delta sign | L(1) | 

| | 


} 


o vy. ac gi is the deguantization table index used for the luma AC 
coefficients (and other coefficient groups if no delta value is 
present) (Section 9.6) 


o y dc delta present indicates if the stream contains a delta value 
that is added to the baseline index to obtain the luma DC 


coefficient deguantization index (Section 9.6) 


o y dc delta magnitude is the magnitude of the delta value 
(Section 9.6) 


o y dc delta sign is the sign of the delta value (Section 9.6) 
o y2 dc delta present indicates if the stream contains a delta value 


that is added to the baseline index to obtain the Y2 block DC 
coefficient deguantization index (Section 9.6) 


Bankoski, et al. Informational [Page 128] 


RFC 6386 VP8 Data Format and Decoding Guide November 2011 


o y2 ac delta present indicates if the stream contains a delta value 
that is added to the baseline index to obtain the Y2 block AC 
coefficient dequantization index (Section 9.6) 


o uv dc delta present indicates if the stream contains a delta value 
that is added to the baseline index to obtain the chroma DC 
coefficient dequantization index (Section 9.6) 


o uv ac delta present indicates if the stream contains a delta value 
that is added to the baseline index to obtain the chroma AC 
coefficient dequantization index (Section 9.6) 


| token prob update () | Type | 
| for (i = 0; i < 4; i++) 4 | | 
for (j = 0; j < 8; J++) 4 
for (k = 0; k < 3; k++) { 

| for (1 = 0; 1 < 11; 1++) { | | 
| coeff_prob_update_flag |ia) 17] 
| if (coeff_prob_update_flag) | 

| coeff_prob | L(8) | 
| | | 
| | | 


o coeff prob update flag indicates if the corresponding branch 
probability is updated in the current frame (Section 13.4) 


o coeff prob is the new branch probability (Section 13.4) 
| mv prob update () | Type | 


for (i = 0; i < 2; i++) 4 


for (j 0; j < 19; j**) ( 
if (mv prob update flag) 
prob L(7) 


| 
mv_prob_update_flag | L(1) 

| 

| 

| 


o mv prob update flag indicates if the corresponding MV decoding 
probability is updated in the current frame (Section 17.2) 


o prob is the updated probability (Section 17.2) 
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19.3.  Macroblock Data 


macroblock_header () | | 
residual data() | | 


| macroblock header() | Type | 


if (update mb segmentation map) 


| | | 

| segment_id | T | 

| if (mb_no_skip_coeff) | 

| mb_skip_coeff | B(p) | 

| if (!key frame) | | 
is_inter_mb | B (p) | 

| if (is inter mb) 1 

| mb ref frame _ sell | B (p) | 

| if (mb ref frame sell) | 

| mb ref frame sel2 | B (p) | 

| mv_mode | T | 
if (mv mode == SPLITMV) { | | 

| mv split mode E 

| for (i = 0; i < numMvs; i++) { | 

| sub_mv_mode | T | 

| if (sub_mv_mode == NEWMV4x4) { | 

| read_mvcomponent () | | 

| read mvcomponent () | | 

} 

| } | | 

| } else if (mv_mode == NEWMV) { | 

| read_mvcomponent () | | 

| read_mvcomponent () | | 
} 

| ) else ( /* intra mb */ | 

| | intra y mode | T | 

| if (intra y mode == B PRED) { | 

| for (i = 0; i < 16; i++) | | 

| intra_b_mode |T | 
} 

| intra uv mode T 

| 


o segment id indicates to which segment the macroblock belongs 
(Section 10) 


o mb skip coeff indicates whether the macroblock contains any coded 
coefficients or not (Section 11.1) 
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is inter mb indicates whether the macroblock is intra- or inter- 
coded (Section 16) 

mb ref frame sell selects the reference frame to be used; last 
frame (0), golden/alternate (1) (Section 16.2) 

mb ref frame sel2 selects whether the golden (0) or alternate 
reference frame (1) is used (Section 16.2) 

mv mode determines the macroblock motion vector mode 

(Section 16.2) 

mv split mode gives the macroblock partitioning specification and 
determines the number of motion vectors used (numMvs) 

(Section 16.2) 

sub mv mode determines the sub-macroblock motion vector mode for 
macroblocks coded using the SPLITMV motion vector mode 

(Section 16.2) 

intra y mode selects the luminance intra-prediction mode 
(Section 16.1) 

intra b mode selects the sub-macroblock luminance prediction mode 
for macroblocks coded using B PRED mode (Section 16.1) 
intra uv mode selects the chrominance intra-prediction mode 
(Section 16.1) 
residual data() | Type | 


if (!mb skip coeff) ( | 
if ( (is_inter_mb && mv_mode != SPLITMV) | | | 
(!is_inter_mb && intra_y_mode != B_PRED) ) | 
residual block() /% Y2 */ 
for (i = 0; i < 24; i++) | 
residual block() /* 16 Y, 4 U, 4 V */ | 
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| residual block () | Type | 


for (i = firstCoeff; i < 16; i++) 4 


| token | T 

| if (token == EOB) break; | 

| if (token_has_extra_bits) | 

| extra_bits | L(n) | 
| if (coefficient != 0) | | 
| sign | L(1) | 


o firstCoeff is 1 for luma blocks of macroblocks containing Y2 
subblock; otherwise 0 


o token defines the value of the coefficient, the value range of the 
coefficient, or the end of block (Section 13.2) 


o extra bits determines the value of the coefficient within the 
value range defined by the token (Section 13.2) 


o Sign indicates the sign of the coefficient (Section 13.2) 
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20. 


20. 


20 


Attachment One: Reference Decoder Source Code 


1. bit ops.h 


==== Begin code block ================+==2=53==3=3========== 


/* 
* Copyright (c) 2010, 2011, Google Inc. All rights reserved. 

x 

* Use of this source code is governed by a BSD-style license 

* that can be found in the LICENSE file in the root of the source 
* tree. An additional intellectual property rights grant can be 

x found in the file PATENTS. All contributing project authors may 
* be found in the AUTHORS file in the root of the source tree. 

8/ 


#ifndef BIT OPS H 
#define BIT OPS H 


/* Evaluates to a mask with n bits set %/ 
#define BITS MASK(n) ((1««(n))-1) 


/* Returns len bits, with the LSB at position bit */ 
#define BITS GET(val, bit, len) (((val)>>(bit)) sBITS MASK(len)) 


Hendif 


= 2 End code block '+======+==5 HSH 92229 9 Ho 282 


.2. | bool decoder.h 


---Begrinocode block =======23====5=====32==22 25922 SH Ses 


/* 
* Copyright (c) 2010, 2011, Google Inc. All rights reserved. 

* 

* Use of this source code is governed by a BSD-style license 

* that can be found in the LICENSE file in the root of the source 
x tree. An additional intellectual property rights grant can be 

x found in the file PATENTS. All contributing project authors may 
* be found in the AUTHORS file in the root of the source tree. 

* 


/ 

#ifndef BOOL DECODER H 
#define BOOL DECODER H 
#include <stddef.h> 
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struct bool decoder 


( 


const unsigned char "input; 


size t 


input len; 


next compressed data byte */ 
length of the input buffer */ 


unsigned int range; /* identical to encoder's 
* range */ 
unsigned int value; /* contains at least 8 


int 


bit count; 


significant bits */ 
# of bits shifted out of 
value, max 7 */ 


y; 


static void 
init_bool_decoder (struct bool decoder *d, 
const unsigned char *start_partition, 


size_t sz) 
{ 
if (sz >= 2) 
{ 
d->value = (start_partition[0] << 8) /* first 2 input 
* bytes */ 
| start partition[1]; 
d->input = start partition + 2; /* ptr to next byte */ 
d->input len < sz - 2; 
} 
else 
{ 
d->value = 0; 
d->input = NULL; 
d->input_len = 0; 
} 
d->range = 255; /* initial range is full */ 


d->bit_count = 0; /* have not yet shifted out any bits */ 


} 


static int bool_get(struct bool_decoder *d, 


{ 


int probability) 


/* range and split are identical to the corresponding values 
used by the encoder when this bool was written */ 


unsigned int split = 1 + (((d->range - 1) * probability) >> 8); 
unsigned int SPLIT < split << 8; 
int retval; /* will be 0 or 1 %/ 
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if (d->value »- SPLIT) /* encoded a one */ 
{ 
retval = 1; 
d->range -= split; /* reduce range */ 
d->value -= SPLIT; /* subtract off left endpoint of 
* interval */ 
} 
else /* encoded a zero */ 
{ 
retval = 0; 
d->range = split; /* reduce range, no change in left 


* endpoint */ 


while (d->range < 128) /* shift out irrelevant value bits */ 


{ 
d->value <<= 1; 
d->range <<= 1; 


if (++d->bit_count == 8) /* shift in new bits 8 at 


{ 
d->bit_count = 0; 


if (d->input_len) 

{ 
d->value |= *d->input++; 
d-»input len--; 


} 


return retval; 


} 


static int bool_get_bit (struct bool_decoder *br) 
{ 

return bool get(br, 128); 
} 
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static int bool get uint(struct bool decoder *br, int bits) 
{ 

int z < 0; 

int bit; 


for (bit < bits - 1; bit >< 0; bit--) 
{ 

z |= (bool_get_bit (br) << bit); 
} 


return z; 


} 


static int bool_get_int (struct bool_decoder *br, int bits) 
{ 

int z < 0; 

int bit; 


for (bit < bits - 1; bit >< 0; bit--) 
{ 

z |= (bool_get_bit (br) << bit); 
} 


return bool get bit(br) ? -z : z; 


} 


static int bool_maybe_get_int (struct bool_decoder *br, int bits) 


{ 
return bool get bit(br) ? bool get int(br, bits) : 0; 
} 


static int 
bool_read_tree(struct bool_decoder *bool, 
const int At, 
const unsigned char "p) 
int i < 0; 
while ((i < t[ i * bool get(bool, pli>>1])]) > 0); 
return -i; 


} 
#endif 


=2== ¿End Code: block, ===5==5====35==35=S==2 ss eS Sse SS ss 
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20.3.  deguant data.h 


smmm Begin code block === === 20232125 32 


/* 

* Copyright (c) 2010, 2011, Google Inc. All rights reserved. 

* 

* Use of this source code is governed by a BSD-style license 

* that can be found in the LICENSE file in the root of the source 
* tree. An additional intellectual property rights grant can be 

x found in the file PATENTS. All contributing project authors may 
* be found in the AUTHORS file in the root of the source tree. 

* 


/ 


static const int dc q lookup[128] - 

{ 
4, 55 6, 7, 8, 9, 10, 10, 
11, 12, 13, 14, 15, 16, 17, 17, 
18, 19, 20, 20, 21, 21, 22; 22, 
23, 23, 24, 25, 25, 26, 27, 28, 
29, 30, 31; 32, 33, 34, 35; 36, 
37, 37, 38, 39, 40, 41, 42, 43, 
44, 45, 46, 46, 47, 48, 49, 50, 
bi, 52, 53; 54, 555 56, 517; 58, 
59, 60, 61, 62, 63, 64, 65, 66, 
67, 68, 69, 70, 71; "27. 73, 74, 
75, 76, 76, TT; 78, 79, 80, 81, 
82, 83, 84, 85, 86, 87, 88, 89, 
91, 93, 957 96, 98, 100, 101, 102, 
104, 106, 108, 110, 112, 114, 116, 118, 
122, 124, 126, 128, 130, 132, 134, 136, 
138, 140, 143, 145, 148, 151, 154, 157 

y; 


static const int ac q lookup[128] 

{ 
4, 5; 6, T7 8, 9, 10, Dt. 
12, 13, 14, 15, 16, 17 18, 19, 
20, 21, 22, 23, 24, 25, 26, 27, 
28, 29, 30, JI; 32, 33, 34, 35; 
36, 37, 38, 39, 40, 41, 42, 43, 
44, 45, 46, 47, 48, 49, 50, 51, 
52, 53, 54, 55, 56, 57, 58, 60, 
62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90, 92, 
94, 96, 98, 100, 102, 104, 106, 108, 
110, 112, 114, 116, 119, 122, 125, 128, 
131, 134, 137, 140, 143, 146, 149, 152, 
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20. 


155, 158, 161, 164, 167, 170, 173, 177, 
181, 185, 189, 193, 197, 201, 205, 209, 
213, 217, 221, 225, 229, 234, 239, 245, 
249, 254, 259, 264, 269, 274, 279, 284 


n 
sete: End code: (block, =ssSsS+SSssssSs S55 Sse SSS SSH SS SSS seseae 


4.  dixie.c 


2es— Begin code block. ==-===2=====5=====3===9952522925=252==2=== 


7 
o obo oo ob ob ob OE a 


Y 


Copyright (c) 2010, 2011, Google Inc. All rights reserved. 


Use of this source code is governed by a BSD-style license 

that can be found in the LICENSE file in the root of the source 
tree. An additional intellectual property rights grant can be 
found in the file PATENTS. All contributing project authors may 
be found in the AUTHORS file in the root of the source tree. 


finclude "vpx codec internal.h" 
finclude "bit ops.h" 

finclude "dixie.h" 

finclude "vp8 prob data.h" 
finclude "deguant data.h" 
finclude "modemv.h" 

finclude "tokens.h" 

finclude "predict.h" 

finclude "dixie loopfilter.h" 
finclude <string.h> 

finclude <assert.h> 


enum 
FRAME HEADER SZ < 3, 


KEYFRAME HEADER SZ 7 


y; 


#define ARRAY COPY (a,b) {\ 


assert (sizeof (a) ==sizeof (b));memcpy (a,b, sizeof (a));) 
static void 
decode_entropy_header (struct vp8_decoder_ctx ROCKI 
struct bool_decoder *bool, 
struct vp8_entropy_hdr *hdr) 
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int i, j, k, l; 


/* Read coefficient probability updates */ 
for (i = 0; i < BLOCK TYPES; i++) 
for (j < 0; j < COEFF BANDS; jtt) 
for (k = 0; k < PREV COEFF CONTEXTS; k++) 
for (l = 0; 1 < ENTROPY NODES; 114) 
if (bool get (bool, 
k coeff entropy update probs 
[i] [j] [k] [11)) 
hdr->coeff_probs[i][j][k][1] = 
bool_get_uint (bool, 8); 


/* Read coefficient skip mode probability */ 
hdr->coeff skip enabled = bool get bit (bool); 


if (hdr->coeff skip enabled) 
hdr->coeff skip prob = bool get uint(bool, 8); 


/* Parse interframe probability updates */ 
if (!ctx->frame hdr.is keyframe) 


{ 


hdr->prob_inter bool_get_uint (bool, 8); 
hdr->prob_last bool_get_uint (bool, 8); 
hdr->prob_gf = bool_get_uint (bool, 8); 


if (bool get bit (bool)) 
for (i = 0; i < 4; i++) 
hdr->y mode probs[i] = bool_get_uint (bool, 8); 


if (bool_get_bit (bool) ) 
for (i = 0; i < 3; itt) 
hdr->uv mode probs[i] = bool get uint(bool, 8); 


for (G 5:07 x € 2; a+) 
for (j = 0; j « MV PROB CNT; j++) 
if (bool get(bool, k mv entropy update probs[il[jl)) 
{ 
int x = bool_get_uint (bool, 7); 
hdr->mv probs[il[j] = x ? x << 1: 1; 
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static void 

decode reference header(struct vp8 decoder ctx Fet; 
struct bool_decoder *bool, 
struct vp8 reference hdr  *hdr) 


unsigned int key = ctx->frame hdr.is keyframe; 


hdr->refresh gf = key ? 1 : bool get bit (bool); 
hdr->refresh arf = key ? 1 : bool get bit (bool); 
hdr->copy gf = key ? 0 : !hdr-»refresh gf 

? bool get uint(bool, 2) : 0; 
hdr-»copy arf = key ? 0 : !hdr-»refresh arf 

? bool get uint(bool, 2) : 0; 
hdr->sign bias[GOLDEN FRAME] < key ? 0 : bool get bit (bool); 
hdr->sign bias[ALTREF FRAME] = key ? 0 : bool get bit (bool); 
hdr->refresh entropy = bool get bit (bool); 
hdr->refresh last = key ? 1 : bool get bit (bool); 


static void 


decode quantizer header(struct vp8 decoder ctx *ctx, 
struct bool decoder *bool, 
struct vp8_quant_hdr *hdr) 


int update; 
int last q = hdr->g index; 


hdr->g index = bool get uint(bool, 7); 
update = last q != hdr->g index; 


update | = (hdr->yl dc delta g = bool maybe get int(bool, 4)); 
update |= (hdr->y2 dc delta g = bool maybe get int(bool, 4)); 
update |= (hdr->y2 ac delta g = bool maybe get int(bool, 4)); 
update |= (hdr->uv dc delta g = bool maybe get int(bool, 4)); 
update |= (hdr->uv ac delta g = bool maybe get int(bool, 4)); 
hdr->delta update - update; 
} 
static void 
decode and init token partitions(struct vp8 decoder ctx *etsx, 
struct bool decoder *bool, 
const unsigned char *data, 
unsigned int SZ, 
struct vp8_token_hdr *hdr) 
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int i; 
hdr->partitions = 1 << bool get uint (bool, 2); 
if (sz < 3 *(hdr-»partitions - 1)) 


vpx internal error(&ctx-»error, VPX CODEC CORRUPT FRAME, 
"Truncated packet found parsing partition" 


" lengths."); 
Sz -= 3 * (hdr->partitions - 1); 
for (i = 0; i < hdr->partitions; 1++) 


if (i < hdr->partitions - 1) 
{ 


(data[2] << 16) 
| (data[1] << 8) | data[0]; 


hdr->partition szli] 


data += 3; 
) 
else 
hdr->partition sz[i] = sz; 


if (sz « hdr->partition sz[il) 
vpx internal error(&ctx-»error, VPX CODEC CORRUPT FRAME, 
"Truncated partition $d", i); 


Sz -= hdr->partition sz[il; 


for (i = 0; i < ctx->token hdr.partitions; i++) 


init bool decoder (sctx->tokens[i].bool, data, 
ctx->token hdr.partition sz[il); 
data += ctx->token hdr.partition sz[il; 


static void 

decode loopfilter header(struct vp8 decoder ctx *ctx, 
struct bool decoder *bool, 
struct vp8_loopfilter_hdr *hdr) 
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if (ctx->frame hdr.is keyframe) 
memset (hdr, 0, sizeof("hdr)); 


hdr->use simple = bool get bit (bool); 
hdr->level bool get uint(bool, 6); 
hdr->sharpness bool get uint(bool, 3); 
hdr->delta enabled = bool get bit (bool); 


if (hdr->delta enabled && bool get bit (bool)) 
{ 


int i; 


for (i = 0; i < BLOCK_CONTEXTS; i++) 
hdr->ref delta[i] = bool maybe get int(bool, 6); 


for (i = 0; i < BLOCK CONTEXTS; i++) 
hdr->mode delta[i] < bool maybe get int(bool, 6); 


static void 

decode segmentation header(struct vp8 decoder ctx *ctx, 
struct bool decoder *bool, 
struct vp8_segment_hdr *hdr) 


if (ctx->frame hdr.is keyframe) 
memset (hdr, 0, sizeof("hdr)); 


hdr->enabled = bool get bit (bool); 
if (hdr->enabled) 
( 

int i; 


hdr->update map = bool get bit (bool); 
hdr->update data = bool get bit (bool); 
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if (hdr->update data) 
{ 
hdr->abs = bool_get_bit (bool); 


for (i = 0; i < MAX MB SEGMENTS; i++) 
hdr->guant idx[i] = bool maybe get int(bool, 7); 


for (i = 0; i < MAX MB SEGMENTS; i++) 
hdr->1f level[i] < bool maybe get int(bool, 6); 
} 


if (hdr->update map) 
{ 
for (i = 0; i < MB FEATURE TREE PROBS; i++) 
hdr->tree_probs[i] = bool get bit (bool) 
? bool get uint(bool, 8) 
255% 


} 


else 


{ 
hdr->update map = 0; 
hdr->update data = 0; 


static void 
dequant global init(struct dequant factors dqf[MAX MB SEGMENTS] ) 
{ 


int 1; 


for (i = 0; i < MAX MB SEGMENTS; i++) 
dqf[i].quant_idx = -1; 


static int 
clamp_q(int q) 
{ 
if (q < 0) return 0; 
else if (q > 127) return 127; 


return q; 
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static int 
dc q(int q) 
{ 
return dc_q_lookup[clamp_q(q) 1]; 
} 


static int 
ac_q(int q) 
{ 
return ac_q_lookup[clamp_q(q)]; 
} 


static void 

dequant_init (struct dequant_factors factors [MAX_MB SEGMENTS], 
const struct vp8_segment_hdr *seg, 
const struct vp8_quant_hdr *quant hdr) 


int i, a; 
struct dequant factors *dgf - factors; 


for (i = 0; i < (seg->enabled ? MAX MB SEGMENTS : 1); i++) 
{ 
q = quant_hdr->q_index; 


if (seg->enabled) 
q = (!seg->abs) ? q + seg-»quant idx[i] 
seg->guant idx[il; 


if (dgf->guant idx != q | | quant hdr->delta update) 

dagf->factor[TOKEN BLOCK Y1][0] = 

dc g(g + quant hdr->yl dc delta g); 
dgf->factor [TOKEN BLOCK Y1][1] = 

ac gla); 
dgf->factor[TOKEN BLOCK UV][0] = 

dc q(q * quant hdr->uv dc delta g); 
dgf->factor [TOKEN BLOCK UV][1] = 

ac g(g + quant hdr->uv ac delta da); 
dgf->factor[TOKEN BLOCK Y2][0] = 

dc g(g + quant hdr->y2 dc delta ag) " 2; 
dgf->factor[TOKEN BLOCK Y2][1] = 

ac q(q + quant hdr->y2 ac delta q) * 155 / 100; 


if (dgf->factor[TOKEN BLOCK Y2][1] 
dqf-»factor[TOKEN BLOCK Y2][1] 


^ 


8) 
8; 
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if (dgf->factor[TOKEN BLOCK UV][0] > 132) 
dqf-»factor[TOKEN BLOCK UV][0] = 132; 


dgf->guant idx = a; 
} 


dagftt; 


static void 

decode frame(struct vp8 decoder ctx "ctx, 
const unsigned char *data, 
unsigned int sz) 


vpx codec err t res; 
struct bool decoder bool; 
int i, row, partition; 


ctx->saved entropy valid < 0; 
if ((res = vp8 parse frame header (data, sz, €ctx->frame_hdr))) 


vpx internal error(&ctx-»error, res, 
"Failed to parse frame header"); 


if (ctx->frame hdr.is experimental) 
vpx internal error(&ctx-»error, VPX CODEC UNSUP BITSTREAM, 
"Experimental bitstreams not supported."); 


data += FRAME HEADER SZ; 
sz -= FRAME HEADER SZ; 


if (ctx->frame hdr.is keyframe) 


( 


data += KEYFRAME HEADER SZ; 


sz -= KEYFRAME HEADER SZ; 
ctx-»mb cols = (ctx->frame hdr.kf.w + 15) / 16; 
ctx-»mb rows = (ctx->frame hdr.kf.h + 15) / 16; 


} 


/* Start the bitreader for the header/entropy partition */ 
init_bool_decoder(&bool, data, ctx->frame hdr.part0 sz); 
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/* Skip the colorspace and clamping bits */ 
if (ctx->frame hdr.is keyframe) 
if (bool get uint(sbool, 2)) 
vpx internal error( 
&ctx-»error, VPX CODEC UNSUP BITSTREAM, 
"Reserved bits not supported."); 


decode segmentation header(ctx, &bool, &ctx-»segment hdr); 
decode loopfilter header(ctx, &bool, &ctx-»loopfilter hdr); 
decode and init token partitions (ctx, 
&bool, 
data + ctx->frame hdr.part0 sz, 
sz — ctx->frame hdr.part0 sz, 
&ctx-»token hdr); 
decode quantizer header(ctx, &bool, &ctx-»quant hdr); 
decode reference header(ctx, &bool, šctx->reference hdr); 


/* Set keyframe entropy defaults. These get updated on keyframes 
* regardless of the refresh entropy setting. 
"y 

if (ctx->frame hdr.is keyframe) 

{ 

ARRAY COPY(ctx-»entropy hdr.coeff probs, 
k default coeff probs); 

ARRAY COPY(ctx-»entropy hdr.mv probs, 
k default mv probs); 

ARRAY COPY(ctx-»entropy hdr.y mode probs, 
k default y mode probs); 

ARRAY COPY(ctx-»entropy hdr.uv mode probs, 
k default uv mode probs); 


} 


if (!ctx-»reference hdr.refresh entropy) 

{ 
ctx->saved entropy = ctx-»entropy hdr; 
ctx->saved entropy valid = 1; 


} 
decode entropy header(ctx, &bool, &ctx-»entropy hdr); 


vp8 dixie modemv init (ctx); 

vp8 dixie tokens init(ctx); 

vp8 dixie predict init(ctx); 

dequant init(ctx->deguant factors, &ctx-»segment hdr, 
&ctx-»quant hdr); 
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for (row < 0, partition < 0; row < ctx-»mb rows; rowtt) 
{ 
vp8 dixie modemv process row( 
ctx, &bool, row, 0, ctx-»mb cols); 
vp8 dixie tokens process row(ctx, partition, row, O0, 
ctx-»mb cols); 
vp8 dixie predict process row(ctx, row, 0, ctx-»mb cols); 


if (ctx->loopfilter hdr.level && row) 
vp8 dixie loopfilter process row(ctx, row - 1, 0, 
ctx->mb cols); 


if (++partition == ctx->token hdr.partitions) 
partition < 0; 


if (ctx->loopfilter hdr.level) 
vp8 dixie loopfilter process row( 
ctx, row = 1, 0, ctx->mb cols); 


ctx->frame cnttt; 


if (!ctx->reference hdr.refresh entropy) 
ctx->entropy hdr = ctx->saved entropy; 
ctx->saved entropy valid = 0; 


} 


/* Handle reference frame updates */ 
if (ctx->reference hdr.copy arf == 1) 
{ 
vp8 dixie release ref frame(ctx->ref frames[ALTREF FRAME]); 
ctx->ref frames[ALTREF FRAME] < 
vp8 dixie ref frame(ctx->ref frames[LAST FRAME]); 
} 
else if (ctx->reference hdr.copy arf == 2) 
{ 
vp8 dixie release ref frame(ctx->ref frames[ALTREF FRAME]); 
ctx->ref frames[ALTREF FRAME] < 
vp8 dixie ref frame(ctx->ref frames[GOLDEN FRAME]); 
} 


if (ctx->reference hdr.copy gf == 1) 
{ 
vp8 dixie release ref frame(ctx->ref frames[GOLDEN FRAME]); 
ctx->ref frames[GOLDEN FRAME] < 
vp8 dixie ref frame(ctx->ref frames[LAST FRAME]); 
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else if (ctx->reference hdr.copy gf == 2) 

{ 
vp8 dixie release ref frame(ctx->ref frames[GOLDEN FRAME]); 
ctx->ref frames[GOLDEN FRAME] < 


vp8 dixie ref frame(ctx->ref frames[ALTREF FRAME]); 
} 


if (ctx->reference hdr.refresh gf) 
vp8 dixie release ref frame(ctx->ref frames[GOLDEN FRAME]); 
ctx->ref frames[GOLDEN FRAME] < 


vp8 dixie ref frame(ctx->ref frames[CURRENT FRAME]); 
} 


if (ctx->reference_hdr.refresh_arf) 
{ 


vp8 dixie release ref frame(ctx->ref frames[ALTREF FRAME]); 
ctx->ref frames[ALTREF FRAME] < 


vp8 dixie ref frame(ctx->ref frames[CURRENT FRAME]); 
} 


if (ctx->reference hdr.refresh last) 
{ 


vp8 dixie release ref frame(ctx-»ref frames[LAST FRAME]); 
ctx->ref frames[LAST FRAME] - 


vp8 dixie ref frame(ctx->ref frames[CURRENT FRAME]); 


void 
vp8 dixie decode init(struct vp8 decoder ctx *ctx) 
{ 

deguant global init(ctx->deguant factors); 


} 
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#define CHECK FOR UPDATE(lval,rval,update flag) do {\ 
unsigned int old = lval; \ 
update flag |= (old != (lval = rval)); N 
} while (0) 


vpx_codec_err_t 

vp8 parse frame header(const unsigned char *data, 
unsigned int SZ, 
struct vp8 frame hdr  *hdr) 


unsigned long raw; 


if (sz « 10) 
return VPX CODEC CORRUPT FRAME; 


/* The frame header is defined as a three-byte little endian 


* value 

XJ. 
raw = data[0] | (data[1] << 8) | (data[2] << 16); 
hdr->is keyframe = IBITS GET(raw, 0, 1); 
hdr->version — BITS GET(raw, 1, 2); 
hdr->is experimental = BITS GET(raw, 3, 1); 
hdr->is shown — BITS GET(raw, 4, 1); 
hdr-»partO0 sz = BITS GET(raw, 5, 19); 


if (sz <= hdr-»partO0 sz + (hdr->is keyframe ? 10 : 3)) 
return VPX CODEC CORRUPT FRAME; 


hdr->frame size updated < 0; 
if (hdr->is keyframe) 
unsigned int update = 0; 


/* Keyframe header consists of a three-byte sync code 


2011 


* followed by the width and height and associated scaling 


* factors. 


Ef 


if (data[3] != 0x9d || data[4] != 0x01 || data[5] != 0x2a) 


return VPX_CODEC_UNSUP_BITSTREAM; 
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raw = data[6] | (data[7] << 8) 
| (data[8] << 16) | (data[9] << 24); 

CHECK_FOR_UPDATE (hdr->kf.w, BITS_GET (raw, 
update) ; 

CHECK_FOR_UPDATE (hdr->kf.scale_w, BITS_GET (raw, 
update) ; 

CHECK FOR UPDATE (hdr->kf.h, BITS GET (raw, 
update); 

CHECK FOR UPDATE (hdr->kf.scale h, BITS_GET (raw, 
update); 


hdr->frame size updated = update; 


if (!hdr-»kf.w || !hdr->kf.h) 
return VPX CODEC UNSUP BITSTREAM; 


return VPX CODEC OK; 


Vpx codec err t 
vp8 dixie decode frame(struct vp8 decoder ctx *ctx, 


const unsigned char *data, 
unsigned int Sz) 
{ 
volatile struct vp8_decoder_ctx *ctx_ = ctx; 


ctx-»error.error code = VPX CODEC OK; 
ctx->error.has detail - 0; 


if 


(! set jmp (ctx->error.jmp)) 
decode frame(ctx, data, sz); 


return ctx ->error.error code; 


void 


vp8 dixie decode destroy(struct vp8 decoder ctx *ctx) 


{ 


vp8 dixie predict_destroy (ctx); 
vp8 dixie tokens destroy(ctx); 
vp8 dixie modemv destroy (ctx); 


} 


---- End code block 
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20.5. dixie.h 


.-c-o—UBegan:code block === 22093222 SH == 


/* 

* Copyright (c) 2010, 2011, Google Inc. All rights reserved. 

* 

* Use of this source code is governed by a BSD-style license 

* that can be found in the LICENSE file in the root of the source 
* tree. An additional intellectual property rights grant can be 

* found in the file PATENTS. All contributing project authors may 
* be found in the AUTHORS file in the root of the source tree. 

* 


/ 

#ifndef DIXIE H 

#define DIXIE H 

finclude "vpx codec internal.h" 
finclude "bool decoder.h" 


struct vp8 frame hdr 


{ 


unsigned int is_keyframe; /* Frame is a keyframe */ 
unsigned int is_experimental; /* Frame is a keyframe */ 
unsigned int version; /* Bitstream version */ 

unsigned int is_shown; /* Frame is to be displayed. */ 
unsigned int parto0 sz; /* Partition 0 length, in bytes */ 


struct vp8 kf hdr 
{ 


unsigned int w; /* Width */ 

unsigned int h; /* Height */ 

unsigned int scale w; /* Scaling factor, Width */ 

unsigned int scale h; /* Scaling factor, Height */ 
) kf; 


unsigned int frame size updated; /* Flag to indicate a resolution 
* update. 
*/ 
y; 


enum 

1 
MB_FEATURE_TREE_PROBS 
MAX_MB_SEGMENTS = 4 


ll 
w 
~ 


y; 
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struct vp8 segment hdr 


( 


unsigned int enabled; 
unsigned int update data; 
unsigned int update map; 
unsigned int abs; /* O=deltas, l=absolute values */ 
unsigned int tree probs[MB FEATURE TREE PROBS]; 
int lf level[MAX MB SEGMENTS]; 
int quant idx[MAX MB SEGMENTS]; 
y; 
enum 


1 
BLOCK_CONTEXTS = 4 


y; 


struct vp8_loopfilter_hdr 
{ 


unsigned int use_simple; 

unsigned int level; 

unsigned int sharpness; 

unsigned int delta_enabled; 

int ref delta[BLOCK CONTEXTS]; 

int mode delta[BLOCK CONTEXTS]; 
y 
enum 


1 
MAX_PARTITIONS = 8 


y; 


struct vp8 token hdr 
{ 

unsigned int partitions; 

unsigned int partition sz[MAX PARTITIONS]; 
y 
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struct vp8 quant hdr 


{ 
unsigned 
int 
int 
int 
int 
int 
int 


y; 


int 


q index; 

delta update; 
yl dc delta dg; 
y2 dc delta dg; 
y2 ac delta dg; 
uv dc delta dg; 
uv ac delta dg; 


struct vp8 reference hdr 


unsigned 
unsigned 
unsigned 
unsigned 
unsigned 
unsigned 
unsigned 


enum 


int 
int 
int 
int 
int 
int 
int 


BLOCK TYPES 
PREV COEFF CONTEXTS < 3, 
COEFF BANDS = 

ENTROPY NODES = 11, 


y; 


refresh_last; 
refresh_gf; 
refresh_arf; 
copy. gf; 

copy arf; 

sign bias[4]; 
refresh entropy; 


typedef unsigned char coeff probs table t[BLOCK TYPES] [COEFF BANDS] 
[PREV COEFF CONTEXTS] 
[ENTROPY NODES]; 


enum 


{ 


MV_PROB_CNT 


y; 


=2 + 8 - 1 + 10 /* from entropymv.h */ 


typedef unsigned char mv component probs t[MV PROB CNT]; 
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struct vp8 entropy hdr 

{ 
coeff_probs_table_t coeff_probs; 
mv component probs t mv probs[2]; 


unsigned int coeff skip enabled; 
unsigned char coeff skip prob; 
unsigned char y mode probs[4]; 
unsigned char uv mode probs[3]; 
unsigned char prob inter; 
unsigned char prob last; 
unsigned char prob gf; 


y; 


enum reference frame 
{ 
CURRENT_FRAME, 
LAST_FRAME, 
GOLDEN_FRAME, 
ALTREF_FRAME, 
NUM REF FRAMES 


y; 


enum prediction_mode 


{ 
/* 16x16 intra modes */ 
DC PRED, V PRED, H PRED, TM PRED, B PRED, 


/* 16x16 inter modes */ 
NEARESTMV, NEARMV, ZEROMV, NEWMV, SPLITMV, 


MB MODE COUNT, 


/* 4x4 intra modes */ 


B DC PRED = 0, B TM PRED, B VE PRED, B HE PRED, 


November 2011 


B LD PRED, 


B RD PRED, B VR PRED, B VL PRED, B HD PRED, B HU PRED, 


/* 4x4 inter modes */ 
LEFT4X4, ABOVE4X4, ZERO4X4, NEW4X4, 


B MODE COUNT 
y; 
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enum splitmv partitioning 
{ 

SPLITMV 16X8, 

SPLITMV 8X16, 

SPLITMV 8X8, 

SPLITMV 4X4 
y 


typedef short filter t[6]; 


typedef union mv 
{ 

struct 

{ 

intl6 t x, y; 

) d; 

uint32 t raw; 
) mv t; 


struct mb base info 

{ 
unsigned char y_mode 4; 
unsigned char uv_mode 4; 
unsigned char segment_id : 2; 
unsigned char ref_frame 2 
unsigned char skip_coeff 1 


, 


, 


unsigned char need mo border : 1; 
enum splitmv partitioning partitioning : 2; 
union mv mv; 


unsigned int  eob mask; 
y; 


struct mb_info 


{ 


struct mb_base_info base; 


union 
{ 
union mv mvs[16]; 
enum prediction_mode modes[16]; 
} split; 


y; 
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/* A "token entropy context" has 4 Y values, 2 U, 2 V, and 1 Y2 */ 
typedef int token entropy ctx t[4 + 2 + 2 + 1]; 


struct token decoder 

( 
struct bool decoder bool; 
token entropy ctx t left token entropy ctx; 
short *coeffs; 

y; 


enum token block type 
{ 
TOKEN_BLOCK_Y1, 
TOKEN_BLOCK_UV, 
TOKEN_BLOCK_Y2, 
TOKEN_BLOCK_TYPES, 
y; 


struct dequant_factors 


{ 
int quant_idx; 
short factor[TOKEN_BLOCK_TYPES][2]; /* [ Yl, UV, Y2 ] 
* 


y; 


struct ref_cnt_img 

1 
vpx_image_t img; 
unsigned int ref_cnt; 


y; 


struct vp8_decoder_ctx 


{ 


struct vpx_internal_error_info error; 


unsigned int frame_cnt; 

struct vp8_frame_hdr frame_hdr; 

struct vp8_segment_hdr segment_hdr; 

struct vp8_loopfilter_hdr loopfilter_hdr; 
struct vp8_token_hdr token_hdr; 

struct vp8_quant_hdr quant_hdr; 

struct vp8_reference_hdr reference_hdr; 
struct vp8_entropy_hdr entropy_hdr; 

struct vp8_entropy_hdr saved_entropy; 
unsigned int saved entropy valid; 
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unsigned int mb rows; 
unsigned int mb cols; 
struct mb info *mb info storage; 
struct mb info **mb info rows storage; 
struct mb info **mb info rows; 
token entropy ctx t *above token entropy ctx; 
struct token decoder tokens [MAX PARTITIONS]; 
struct dequant factors dequant factors[MAX MB SEGMENTS]; 
struct ref cnt img frame strg[NUM REF FRAMES]; 
struct ref cnt img *ref frames[NUM REF FRAMES]; 
ptrdiff t ref frame offsets[4]; 
const filter t *subpixel filters; 

y; 

void 


vp8 dixie decode init(struct vp8 decoder ctx *ctx); 


void 
vp8 dixie decode destroy(struct vp8 decoder ctx *ctx); 


Vpx codec err t 

vp8 parse frame header(const unsigned char *data, 
unsigned int SZ, 
struct vp8 frame hdr  *hdr); 


Vpx codec err t 
vp8 dixie decode frame(struct vp8 decoder ctx *ctx, 


const unsigned char *data, 
unsigned int sz); 
#define CLAMP 255(x) ((x)<0?0:((x)>255?255:(x))) 


Hendif 


=== Eno: ¿sae block Ss sss 5 SSS SSeS SSS Ss === 92 === PE 
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20.6. 


dixie loopfilter.c 


smmm Begin code block === 20932122 SS 32 


/ 


+ + X Xo DR Ro Xo Xo F 


Copyright (c) 2010, 2011, Google Inc. All rights reserved. 


Use of this source code is governed by a BSD-style license 

that can be found in the LICENSE file in the root of the source 
tree. An additional intellectual property rights grant can be 
found in the file PATENTS. All contributing project authors may 
be found in the AUTHORS file in the root of the source tree. 


finclude "dixie.h" 
finclude "dixie loopfilter.h" 


#define ABS(x) ((x) >= 0 ? (x) : -(x)) 


#define p3 pixels[-4"stride] 
#define p2 pixels[-3*stride] 
#define pl pixels[-2*stride] 
#define p0 pixels[-l"stride] 
#define q0 pixels[ O*stride] 
#define gl pixels[ l"stride] 
#define q2 pixels[ 2*stride] 
#define q3 pixels[ 3*stride] 


#define static 
static int 


saturate_int8(int x) 


{ 


if (x < -128) 
return -128; 


if (x > 127) 
return 127; 


return x; 
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static int 
saturate uint8(int x) 
{ 
if (x < 0) 
return 0; 


if (x > 255) 
return 255; 


return x; 


static int 
high_edge_variance (unsigned char *pixels, 
int stride, 
int hev_threshold) 


return ABS(pl - p0) > hev threshold | | 
ABS (q1 - q0) > hev threshold; 


static int 
simple threshold(unsigned char "pixels, 
int stride, 
int filter limit) 


return (ABS(p0- q0) * 2 + (ABS(pl - gl) >> 1)) <= filter limit; 


static int 
normal threshold(unsigned char "pixels, 


int stride, 
int edge limit, 
int interior limit) 


int E < edge limit; 
int I < interior limit; 


return simple threshold( 
&& ABS(p3 - p2) < 
&& ABS(pl - p0) <= 
&& ABS(q2 - gl) << 


pixels, stride, 2 * 
= I && ABS(p2 - pl) <= 
I && ABS(q3 - q2) 
I && ABS(gl - q0) 


Bankoski, et al. Informational [Page 159] 


RFC 6386 


static 


VP8 Data Format and Decoding Guide 


void 
filter common (unsigned char "pixels, 
int stride, 
int use outer taps) 


if 


Bankoski, 


= 3 * (q0 — p0); 


(use_outer_taps) 
a += saturate int8(pl - gl); 


— saturate int8 (a); 


((a + 4 > 127) ? 127 : a + 4) >> 3; 
= ((a + 3 > 127) ? 127 : a + 3) >> 3; 


= saturate uint8(p0 + f2); 
— saturate uint8(g0 - fl); 


(!use outer taps) 


/* This handles the case of subblock filter() 
* (from the bitstream guide. 


el 
a = (fl + 1) >> 1; 
pl = saturate uint8 (pl + a); 
ql = saturate uint8 (ql - a); 
et al. Informational 


November 2011 


[Page 160] 


RFC 6386 VP8 Data Format and Decoding Guide November 2011 


static void 

filter mb edge(unsigned char *pixels, 
int stride) 

{ 


int w, a; 


= 
ll 


saturate int8(saturate int8(pl - ql) + 3 * (q0 - p0)); 


a = (27 % w + 63) >> 7; 
pO = saturate uint8(p0 + a); 
q0 = saturate uint8(g0 - a); 


a = (18 * w+ 63) >> 7; 
pl = saturate uint8 (pl + a); 
gl = saturate uint8(gl ; 


| 
w 


a= (9 * w+ 63) >> 7; 
p2 = saturate uint8(p2 + a); 
q2 = saturate uint8(a2 ; 


| 
o 


static void 
filter mb v edge(unsigned char "src, 


int stride, 
int edge limit, 
int interior limit, 
int hev threshold, 
int size) 
{ 
int 1; 
for (i = 0; i < 8 * size; i++) 


if (normal threshold(src, 1, edge limit, interior limit)) 


if (high edge variance(src, 1, hev threshold)) 
filter common(src, 1, 1); 
else 
filter mb edge(src, 1); 
} 


src += stride; 
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static void 
filter subblock v edge(unsigned char *src, 


int stride, 
int edge limit, 
int interior limit, 
int hev threshold, 
int size) 

i; 

(i = 0; i < 8 * size; i++) 


if (normal_threshold(src, 1, edge_limit, interior_limit)) 


filter common(src, 1, 


src += stride; 


static void 
filter_mb_h_edge(unsigned char *src, 


Bankoski, 


high_edge_variance(src, 1, 


int stride, 
int edge_limit, 
int interior_limit, 
int hev_threshold, 
int size) 

i; 

(i = 0; i < 8 * size; i++) 


if (normal threshold(src, stride, edge limit, 


interior limit)) 


November 
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hev threshold)); 


if (high edge variance(src, stride, hev threshold)) 
filter common(src, stride, 1); 


else 


filter mb edge(src, stride); 


src += 1; 
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static void 
filter subblock h edge(unsigned char *src, 


int stride, 
int edge limit, 
int interior limit, 
int hev threshold, 
int size) 
{ 
int. i; 
for (i = 0; i < 8 * size; i++) 


if (normal threshold(src, stride, edge limit, 
interior limit)) 
filter common(src, stride, 
high edge variance(src, stride, 
hev threshold)); 


src += 1; 


static void 
filter_v_edge_simple(unsigned char *src, 
int stride, 
int filter limit) 


for (i = 0; i < 16; i++) 


if (simple threshold(src, 1, filter limit)) 
filter common(src, 1, 1); 


src += stride; 
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static void 
filter h edge simple(unsigned char "src, 
int stride, 
int filter limit) 


for (i = 0; i < 16; i++) 


if (simple threshold(src, stride, filter limit)) 
filter common(src, stride, 1); 


src += 1; 


static void 
calculate filter parameters(struct vp8 decoder ctx *ctx, 


2011 


struct mb info *mbi, 

int *edge limit , 

int *interior limit , 
int *hev threshold ) 


int filter level, interior limit, hev threshold; 


/* Reference code/spec seems to conflate filter level and 
* edge limit 


*/ 
filter level = ctx->loopfilter hdr.level; 


if (ctx-»segment hdr.enabled) 
if (!ctx->segment hdr.abs) 
filter level += 
ctx->segment hdr.lf level[mbi->base.segment id]; 
else 
filter level - 
ctx->segment hdr.lf level[mbi->base.segment id]; 


} 


if (filter level > 63) 
filter level - 63; 
else if (filter level « 0) 
filter level < 0; 
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if (ctx->loopfilter hdr.delta enabled) 
{ 
filter level += 
ctx->loopfilter hdr.ref delta[mbi->base.ref frame]; 


if (mbi->base.ref frame == CURRENT FRAME) 
{ 

if (mbi->base.y_mode == B_PRED) 

filter level += ctx->loopfilter hdr.mode delta[0]; 

} 
else if (mbi->base.y mode == ZEROMV) 

filter level += ctx->loopfilter hdr.mode delta[1]; 
else if (mbi->base.y mode == SPLITMV) 

filter level += ctx->loopfilter hdr.mode delta[3]; 
else 

filter level += ctx->loopfilter hdr.mode delta[2]; 

} 


if (filter_level > 63) 
filter_level = 63; 
else if (filter_level < 0) 
filter_level = 0; 


interior_limit = filter_level; 


if (ctx->loopfilter hdr.sharpness) 


interior limit >>= ctx->loopfilter hdr.sharpness > 4 ? 2 : 1; 
if (interior limit > 9 - ctx->loopfilter hdr.sharpness) 
interior limit < 9 - ctx->loopfilter hdr.sharpness; 


} 


if (interior limit < 1) 
interior limit < 1; 


hev threshold < (filter level >< 15); 


if (filter level »- 40) 
hev thresholdtt; 


if (filter level >= 20 && !ctx-»frame hdr.is keyframe) 
hev thresholdtt; 


*edge limit  -» filter level; 
*interior limit  - interior limit; 
*hev threshold = hev threshold; 
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static void 
filter row normal(struct vp8 decoder ctx *ctx, 


unsigned int row, 
unsigned int start col, 
unsigned int num cols) 


unsigned char *y, *u, "v; 


int stride, uv stride; 
struct mb info *mbi; 
unsigned int col; 


/* Adjust pointers based on row, start col */ 


stride = ctx->ref frames[CURRENT FRAME]->img.stride[PLANE Y]; 
uv stride = ctx->ref frames[CURRENT FRAME]->img.stride[PLANE U]; 
y = ctx-»ref frames[CURRENT FRAME]->img.planes[PLANE Y]; 
u = ctx-»ref frames[CURRENT FRAME]-»img.planes[PLANE U]; 
v — ctx->ref frames[CURRENT FRAME]->img.planes[PLANE V]; 
y += (stride * row + start col) * 16; 
u 
v 


+= (uv_stride * row + start_col) * 8; 
+= (uv_stride * row + start_col) * 8; 
mbi = ctx-»mb info rows[row] + start col; 


for (col = start col; col « start col + num cols; col++) 
int edge limit, interior limit, hev threshold; 


/* TODO: Only need to recalculate every MB if segmentation is 
* enabled. 
El 
calculate filter parameters (ctx, mbi, &edge limit, 
&interior limit, &hev threshold); 


if (edge limit) 
{ 
if (col) 
{ 
filter_mb_v_edge(y, stride, edge_limit + 2, 
interior_limit, hev_threshold, 2); 
filter_mb_v_edge(u, uv_stride, edge_limit + 2, 
interior_limit, hev_threshold, 1); 
filter_mb_v_edge(v, uv_stride, edge_limit + 2, 
interior_limit, hev_threshold, 1); 
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/* NOTE: 


* number of coefficients decoded, 
* coded in the bitstream. 


November 2011 


This conditional is actually dependent on the 


not the skip flag as 
The tokens task is expected 


* to set 31 if there is *any* non-zero data. 


E 
if (mbi->base.eob mask 
|| mbi->base.y mode -- SPLITMV 
| | mbi->base.y_mode == B_PRED) 
{ 
filter_subblock_v_edge(y + 4, stride, edge_limit, 
interior_limit, hev_threshold, 
2); 
filter subblock v edge(y * 8, stride, edge limit, 
interior limit, hev threshold, 
2); 
filter subblock v edge(y t 12, stride, edge limit, 
interior limit, hev threshold, 
2); 
filter subblock v edge(u * 4, uv stride, edge limit, 
interior limit, hev threshold, 
1); 
filter subblock v edge(v * 4, uv stride, edge limit, 
interior limit, hev threshold, 
1); 
} 
if (row) 
{ 
filter_mb_h_edge(y, stride, edge_limit + 2, 
interior_limit, hev_threshold, 2); 
filter_mb_h_edge(u, uv_stride, edge_limit + 2, 
interior_limit, hev_threshold, 1); 
filter_mb_h_edge(v, uv_stride, edge_limit + 2, 
interior_limit, hev_threshold, 1); 
} 
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} 


if (mbi->base.eob_mask 
|| mbi-»base.y mode == SPLITMV 
|| mbi->base.y mode == B PRED) 


filter subblock h edge(y * 4 * stride, stride, 
edge limit, interior limit, 
hev threshold, 2); 

filter subblock h edge(y t 8 * stride, stride, 
edge limit, interior limit, 
hev threshold, 2); 

filter subblock h edge(y * 12 * stride, stride, 
edge limit, interior limit, 
hev threshold, 2); 

filter subblock h edge(u + 4 * uv stride, uv stride, 
edge limit, interior limit, 
hev threshold, 1); 

filter subblock h edge(v * 4 * uv stride, uv stride, 
edge limit, interior limit, 
hev threshold, 1); 


static void 
filter row simple(struct vp8 decoder ctx *ctx, 


unsigned int row, 
unsigned int start col, 
unsigned int num cols) 


unsigned char  *y; 


int stride; 
struct mb info *mbi; 
unsigned int col; 


/* Adjust pointers based on row, start col */ 

stride = ctx->ref frames[CURRENT FRAME]->img.stride[PLANE Y]; 
y = ctx->ref frames[CURRENT FRAME]->img.planes[PLANE Y]; 

y += (stride * row + start col) * 16; 

mbi < ctx->mb info rows[row] t start col; 
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(col = start col; col < start col + num cols; col+t) 


int edge_limit, interior_limit, hev_threshold; 


/* TODO: Only need to recalculate every MB if segmentation is 


* enabled. 


*/ 


calculate filter parameters(ctx, mbi, &edge limit, 


if 
{ 


&interior limit, &hev_threshold) ; 


(edge_limit) 


/* NOTE: This conditional is actually dependent on the 
* number of coefficients decoded, not the skip flag as 
* coded in the bitstream. The tokens task is expected 
* to set 31 if there is *any* non-zero data. 


* 


int filter subblocks = (mbi->base.eob mask 
|| mbi->base.y mode == SPLITMV 
| | mbi->base.y_mode == B_PRED); 
int mb limit = (edge limit + 2) * 2 + interior limit; 
int b limit = edge limit * 2 + interior limit; 
if (col) 


filter v edge simple(y, stride, mb limit); 


if (filter. subblocks) 
filter v edge simple(y t 4, stride, b limit); 
filter v edge simple(y t 8, stride, b limit); 
filter v edge simple(y t 12, stride, b limit); 
} 


if (row) 
filter_h_edge_simple(y, stride, mb_limit); 


if (filter_subblocks) 
{ 
filter_h_edge_simple(y + 4 * stride, stride, 


b_limit); 
filter_h_edge_simple(y + 8 * stride, stride, 
b_limit); 
filter_h_edge_simple(y + 12 * stride, stride, 
b_limit); 
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y += 16; 
mbitt; 
} 
} 
void 
vp8_dixie_loopfilter_process_row(struct vp8_decoder_ctx *ctx, 
unsigned int row, 
unsigned int start_col, 
unsigned int num_cols) 


if (ctx->loopfilter_hdr.use_simple) 
filter_row_simple(ctx, row, start_col, num_cols); 
else 
filter_row_normal (ctx, row, start_col, num_cols); 


} 
=> “ana code DIOOk 3235242 POR RO ee 
20.7. dixie loopfilter.h 


--s- Begin code block ===2====>32=>532=32++22S22 +32 


/* 
* Copyright (c) 2010, 2011, Google Inc. All rights reserved. 

* 

* Use of this source code is governed by a BSD-style license 

* that can be found in the LICENSE file in the root of the source 
* tree. An additional intellectual property rights grant can be 

x found in the file PATENTS. All contributing project authors may 
* be found in the AUTHORS file in the root of the source tree. 

* 


/ 
#ifndef DIXIE LOOPFILTER H 
#define DIXIE LOOPFILTER H 


void 

vp8 dixie loopfilter process row(struct vp8 decoder ctx *ctx, 
unsigned int row, 
unsigned int start col, 
unsigned int num cols); 

fendif 


scum ENA Code DVA OK o Ri oa iN aii 
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20.8.  idct add.c 


smmm Begin codé block === 220932225 == 


/* 
* Copyright (c) 2010, 2011, Google Inc. All rights reserved. 

* 

* Use of this source code is governed by a BSD-style license 

* that can be found in the LICENSE file in the root of the source 
x tree. An additional intellectual property rights grant can be 

* found in the file PATENTS. All contributing project authors may 
* be found in the AUTHORS file in the root of the source tree. 

x7 


#include "dixie.h" 
#include "idct_add.h" 
#include <assert.h> 


void 
vp8_dixie_walsh (const short *input, short *output) 
{ 

int i; 

int al, bl, Cl, al; 

int a2, b2, c2, a2; 

const short "ip = input; 

short "op = output; 


Il 
e 
H 
^ 
A 
H- 
+ 
+ 


for (i 
{ 
al = ip[0] + ip[12]; 
bl = ip[4] + ipI8]; 


cl = ip[4] - ipl8]; 
dl < ip[0] - ip[12]; 
op[0] = al + bl; 
op[4] = cl + dl; 
op[8] = al - bl; 
op[12] = dl - cl; 
iptt; 
optt; 

} 

ip = output; 

op = output; 
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for (i = 0; i < 4; i++) 


al = ip[0] + ip[3]; 
bl = ip[1] + ip[2]; 
cl = ip[1] - ip[21; 
dl = ip[0] - ip[31; 


a2 = al + bl; 
b2 = cl 
c2 = al - bl; 
d2 = dl — cl; 


+ 
O, 
E 


op[0] = (a2 + 3) >> 3; 
opll] < (b2 + 3) >> 3; 
op[2] < (c2 t 3) >> 3; 
op[3] < (d2 t 3) >> 3; 
ip += 4; 
op += 4; 


#define cospi8sqrt2minusl 20091 
#define sinpi8sqrt2 35468 
#define rounding 0 
static void 
idct_columns(const short *input, short *output) 
{ 
int i; 
int al, bl, cl, dl; 


const short "ip = input; 
short *op - output; 

int templ, temp2; 

int shortpitch < 4; 


for (i = 0; i < 4; i++) 
{ 
al = ip[0] + ip[8]; 
bl = ip[0] - ipI81; 


templ = (ip[4] * sinpi8sqrt2 + rounding) >> 16; 
temp2 = ip[12] + 

((ip[12] * cospi8sgrt2minusl + rounding) >> 16); 
cl = templ - temp2; 
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templ < ip[4] t 

((ip[4] * cospi8sqrt2minusl + rounding) >> 16); 
temp2 = (ip[12] * sinpi8sqrt2 + rounding) >> 16; 
dl = templ + temp2; 


op[shortpitch*0] = al + dl; 
op[shortpitch*3] < al - dl; 


oplshortpitch"1] = bl + cl; 
op[shortpitch*2] = bl - cl; 


iptt; 
optt; 
} 
} 
void 
vp8 dixie idct add(unsigned char *recon, 
const unsigned char  *predict, 
int stride, 
const short *coeffs) 
{ 
int i; 


int al, bl, cl, dl, templ, temp2; 
short tmp[16]; 
idct_columns(coeffs, tmp); 

coeffs = tmp; 


for (i = 0; i < 4; itt) 

{ 
al = coeffs[0] + coeffs[2]; 
bl = coeffs[0] - coeffs[2]; 


templ = (coeffs[1] * sinpi8sqrt2 + rounding) >> 16; 
temp2 = coeffs[3] + 

((coeffs[3] * cospi8sgrt2minusl + rounding) >> 16); 
cl = templ - temp2; 


templ = coeffs[1] + 

((coeffs[1] * cospi8sqrt2minusl + rounding) >> 16); 
temp2 = (coeffs[3] * sinpi8sqrt2 + rounding) >> 16; 
dl = templ + temp2; 


recon[0] = CLAMP 255(predict[0] + ((al + dl + 4) >> 3)); 
recon[3] = CLAMP 255(predict[3] + ((al - dl + 4) >> 3)); 
recon[1] = CLAMP 255(predict[1] + ((bl + cl + 4) >> 3)); 
recon[2] = CLAMP 255(predict[2] + ((bl - cl + 4) >> 3)); 
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} 


coeffs += 4; 
recon += stride; 
predict += stride; 


sea End code DlOCk, =ssSsS—SSsSssSs S55 Snes SSS SSH SS SSS = E 


20.9. 


idct_add.h 


---s— Begin code block ==-===2=====5=====3====25252925=252====== 


7 


o a oo ob Sb ob obSOX* 


Copyright (c) 2010, 2011, Google Inc. All rights reserved. 


Use of this source code is governed by a BSD-style license 

that can be found in the LICENSE file in the root of the source 
tree. An additional intellectual property rights grant can be 
found in the file PATENTS. All contributing project authors may 
be found in the AUTHORS file in the root of the source tree. 


#ifndef IDCT ADD H 
#define IDCT ADD H 


void 
vp8 dixie idct add init(struct vp8 decoder ctx *ctx); 


void 

vp8 dixie idct add(unsigned char *recon, 
const unsigned char  *predict, 
int stride, 
const short *coeffs); 

void 


vp8 dixie walsh(const short *in, short *out); 


void 


vp8 dixie idct add process row(struct vp8 decoder ctx *ctx, 
short *coeffs, 
unsigned int row, 
unsigned int start col, 
unsigned int num cols); 


#endif 


===>) JBNnd Code block —--——-e-92--e5--50-2--eopu-uescececoeccsccpo 
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20.10. 


Bankoski, 


mem.h 


+==="Begin code block -2--------2-5- 


/ 


Copyright (c) 2010, 2011, 


tree. 
found in the file PATENTS. 


+ + X Xo + F F Xo F 


/ 

#ifndef VPX PORTS MEM H 
#define VPX PORTS MEM H 
finclude "vpx config.h" 
finclude "vpx integer.h" 


#if defined(__GNUC__) && GNUC 

#define DECLARE ALIGNED (n,typ,val) 
((aligned (n))) 

#elif defined( MSC VER) 

#define DECLARE ALIGNED (n,typ,val) 

#else 


VP8 Data Format and Decoding Guide 


Google Inc. 


November 2011 


All rights reserved. 


Use of this source code is governed by a BSD-style license 

that can be found in the LICENSE file in the root of the source 
An additional intellectual property rights grant can be 
All contributing project authors may 
be found in the AUTHORS file in the root of the source tree. 


typ val __attribute__ \ 


. declspec(align(n)) typ val 


fwarning No alignment directives known for this compiler. 


#define DECLARE ALIGNED (n, typ, val) 
#endif 
#endif 


/* Declare an aligned array on the stack, 
* stack pointer may not have the alignment we expect. 
then defines val to be a pointer, 


* array with a modified name, 


typ val 


for situations where the 
Creates an 
and 


* aligns that pointer within the array. 


Ria 


#define DECLARE ALIGNED ARRAY (a,typ,val,n)A 


typ valHH [(n)t(a) /sizeof (typ) tl];N 
(typ*) ((((intptr t)vald4 )-(a)-1)&((intptr t)-(a))) 


typ *val - 


et al. 
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/* Indicates that the usage of the specified variable has been 
* audited to assure that it's safe to use uninitialized.  Silences 
* 'may be used uninitialized' warnings on gcc. 


*/ 

dif defined(__GNUC__) && GNUC 
#define UNINITIALIZED IS SAFE(x) x-x 
#else 

#define UNINITIALIZED IS SAFE(x) x 
#endif 


ceecJgend-eode block —--——---92--95--80-2c-epy-uescesc-eesccp- 


20.11.  modemv.c 


-—--- Begin code block -----------5--—---o-cec-oeoccec-cesooe- 


/* 
* Copyright (c) 2010, 2011, Google Inc. All rights reserved. 

* 

* Use of this source code is governed by a BSD-style license 

* that can be found in the LICENSE file in the root of the source 
* tree. An additional intellectual property rights grant can be 

x found in the file PATENTS. All contributing project authors may 
* be found in the AUTHORS file in the root of the source tree. 

*4 


#include "dixie.h" 
#include "modemv data.h" 
#include <stdlib.h> 
#include <assert.h> 


struct mv_clamp_rect 


{ 
int to_left, to_right, to_top, to_bottom; 
y; 
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static union mv 
clamp mv(union mv raw, const struct mv clamp rect *bounds) 


union mv newmv; 


newmv.d.x (raw.d.x < bounds->to left) 


? bounds->to left : raw.d.x; 
newmv.d.x < (raw.d.x > bounds->to right) 
? bounds->to right : newmv.d.x; 


(raw.d.y < bounds->to top) 
? bounds->to top : raw.d.y; 
newmv.d.y < (raw.d.y > bounds->to bottom) 

? bounds->to bottom : newmv.d.y; 
return newmv; 


newmv.d.y 


static int 
read segment id(struct bool decoder "bool, 
struct vp8 segment hdr "seg) 
return bool get(bool, seg->tree probs[0]) 
? 2 + bool_get (bool, seg->tree probs[2]) 
bool get(bool, seg->tree probs[1]); 


static enum prediction mode 

above block mode(const struct mb info "this, 
const struct mb info *above, 
unsigned int b) 


if (b < 4) 
{ 
Switch (above->base.y mode) 
{ 
case DC_PRED: 
return B_DC_PRED; 
case V_PRED: 
return B_VE_PRED; 
case H_PRED: 
return B HE PRED; 
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case TM PRED: 

return B TM PRED; 
case B PRED: 

return above->split.modes[b+12]; 
default: 

assert (0); 


} 


return this->split.modes[b-4]; 


static enum prediction mode 

left block mode(const struct mb info "this, 
const struct mb info "left, 
unsigned int b) 


if (!(b & 3)) 
{ 
switch (left->base.y mode) 
{ 
case DC_PRED: 
return B DC PRED; 
case V PRED: 
return B VE PRED; 
case H PRED: 
return B HE PRED; 
case TM PRED: 
return B TM PRED; 
case B PRED: 
return left->split.modes[b+3]; 
default: 
assert (0); 


} 


return this->split.modes[b-1]; 
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static void 


decode kf mb mode(struct mb info *this, 
struct mb info *left, 
struct mb info *above, 


struct bool decoder *bool) 
int y mode, uv mode; 
y mode - bool read tree(bool, kf y mode tree, kf y mode probs); 
if (y mode -- B PRED) 
{ 


unsigned int i; 


for (i = 0; i < 16; i++) 
{ 


above block mode(this, above, 
i); 

enum prediction mode 1 = left block mode(this, left, i); 

enum prediction mode b; 


enum prediction mode a 


b < bool read tree(bool, b mode tree, 
kf b mode probs[al[11); 
this-»5split.modes[i] b; 


} 


uv_mode = bool_read_tree(bool, uv_mode_tree, kf_uv_mode_probs) ; 


this->base.y mode = y mode; 
this->base.uv mode < uv mode; 
this->base.mv.raw - 0; 


this->base.ref frame 0; 
} 
static void 
decode_intra_mb_mode(struct mb_info *this, 


struct vp8 entropy hdr *hdr, 
struct bool decoder *bool) 


/* Like decode_kf_mb_mode, but with probabilities transmitted in 
* the bitstream and no context on the above/left block mode. 
x / 


int y_mode, uv_mode; 


y_mode = bool_read_tree(bool, y_mode_tree, hdr->y_mode_probs) ; 
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if (y mode -- B PRED) 
{ 


unsigned int i; 


for (i = 0; i < 16; i++) 
{ 


enum prediction_mode b; 


b = bool_read_tree(bool, b_mode_tree, 
default_b_mode_probs) ; 
this->split.modesli] = b; 


} 
uv mode = bool read tree(bool, uv mode tree, hdr-»uv mode probs); 


this->base.y mode - y mode; 
this->base.uv mode = uv mode; 
this->base.mv.raw = 0; 


this->base.ref frame CURRENT FRAME; 


static int 
read mv component (struct bool decoder "bool, 
const unsigned char  mvc[MV PROB CNT]) 
enum (IS SHORT, SIGN, SHORT, BITS = SHORT + 8 - 1, 
LONG WIDTH < 10); 


int x < 0; 
if (bool get(bool, mvc[IS SHORT])) /* Large %/ 
{ 

int i = 0; 


for (i = 0; i < 3; itt) 
x += bool_get (bool, mvc[BITS + i]) << i; 


/* Skip bit 3, which is sometimes implicit */ 
for (i = LONG_WIDTH - 1; i > 3; i--) 
x += bool_get (bool, mvc[BITS + i]) << i; 


if (!(x € OxFFFO) || bool_get (bool, mvc[BITS + 3])) 
x += 8; 
} 
else /* small */ 
x = bool_read_tree(bool, small_mv_tree, mvc + SHORT); 
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if (x && bool get(bool, mvc[SIGN])) 
x = -X; 


return x «« 1; 


static mv t 

above block mv(const struct mb info *this, 
const struct mb info *above, 
unsigned int b) 


if (b « 4) 


{ 
if (above->base.y mode == SPLITMV) 


return above->split.mvs[b+12]; 


return above->base.mv; 


} 


return this->split.mvs[b-4]; 


static mv_t 

left block mv(const struct mb info "this, 
const struct mb info "left, 
unsigned int b) 


if (!(b & 3)) 
{ 
if (left->base.y mode == SPLITMV) 
return left->split.mvs[b+3]; 


return left->base.mv; 


} 


return this->split.mvs[b-1]; 
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static enum prediction mode 
submv ref(struct bool decoder *bool, union mv 1, union mv a) 
{ 
enum subblock_mv_ref 
{ 
SUBMVREF NORMAL, 
SUBMVREF LEFT ZED, 
SUBMVREF ABOVE ZED, 
SUBMVREF LEFT ABOVE SAME, 
SUBMVREF LEFT ABOVE ZED 


y; 


int lez = !(l.raw); 
int aez = !(a.raw); 
int lea = l.raw == a.raw; 


enum subblock_mv_ref ctx = SUBMVREF_NORMAL; 


if (lea && lez) 


ctx = SUBMVREF_LEFT_ABOVE_ZED; 
else if (lea) 

ctx = SUBMVREF LEFT ABOVE SAME; 
else if (aez) 

ctx < SUBMVREF ABOVE ZED; 
else if (lez) 

ctx = SUBMVREF LEFT ZED; 


return bool read tree(bool, submv ref tree, 
submv ref probs2[ctx]); 


static void 

read mv(struct bool decoder  "bool, 
union mv *mv, 
mv component probs t  mvc[2]) 


mv-»d.y = read mv component (bool, mvc[0]); 
mv->d.x = read mv component (bool, mvc[1]); 


Bankoski, et al. Informational [Page 182] 


RFC 6386 


static void 

mv bias(const struct mb info 
const unsigned int 
enum reference frame 
union mv 


if (sign bias[mb->base.ref frame] 


{ 
mv->d.x *= -1; 
mv-»d.y *= -1; 


enum near mv v 

{ 
CNT_BEST = 0, 
CNT_ZEROZERO = 0, 
CNT_NEAREST, 
CNT_NEAR, 
CNT_SPLITMV 

y; 


static void 


VP8 Data Format and Decoding Guide 


*mb, 

sign bias[3], 
ref frame, 
*mv) 


November 2011 


^ sign bias[ref frame]) 


find near mvs(const struct mb info *this, 
const struct mb info *left, 
const struct mb info *above, 
const unsigned int sign bias[3], 
union mv near mvs[4], 
int cnt[4]) 

{ 

const struct mb info *aboveleft = above - 1; 


union mv 
int 


*mv = 


*cntx - cnt 


/* Zero accumulators */ 


near mvs;j 


, 


mv[0].raw < mv[1].raw < mv[2].raw < 0; 
cnt[0] = cnt[1] = cnt[2] = cnt[3] = 0; 
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/* Process above */ 
if (above->base.ref frame != CURRENT FRAME) 
{ 

if (above->base.mv.raw) 


{ 


(ttmv) ->raw = above->base.mv.raw; 


November 2011 


mv bias(above, sign bias, this->base.ref frame, mv); 


++cntx; 
} 


*cntx += 2; 


} 


/* Process left */ 
if (left->base.ref frame != CURRENT FRAME) 
{ 


if (left->base.mv.raw) 


{ 


union mv this_mv; 


this mv.raw = left->base.mv.raw; 


mv bias(left, sign bias, this->base.ref frame, &this mv); 


if (this mv.raw != mv->raw) 

{ 
(++mv)->raw = this mv.raw; 
++cntx; 


} 


*cntx += 2; 
} 
else 
cnt [CNT_ZEROZERO] += 2; 
} 


/* Process above left */ 
if (aboveleft->base.ref frame != CURRENT FRAME) 
{ 


if (aboveleft->base.mv.raw) 


{ 


union mv this_mv; 


this mv.raw = aboveleft->base.mv.raw; 


mv bias(aboveleft, sign bias, this->base.ref frame, 


&this mv); 
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if (this mv.raw != mv->raw) 

{ 
(++mv)->raw = this mv.raw; 
++cntx; 


} 


*cntx += 1; 
} 
else 
cnt [CNT_ZEROZERO] += 1; 


} 
/* If we have three distinct MVs ... */ 


if (cnt[CNT SPLITMV]) 
{ 
/* See if above-left MV can be merged with NEAREST */ 
if (mv->raw == near mvs[CNT NEAREST].raw) 
cnt[CNT NEAREST] += 1; 
} 


cnt [CNT SPLITMV] = ((above->base.y mode == SPLITMV) 
+ (left->base.y mode == SPLITMV)) " 2 
+ (aboveleft->base.y mode == SPLITMV); 


/* Swap near and nearest if necessary %/ 
if (cnt[CNT NEAR] » cnt[CNT NEAREST]) 
{ 
int tmp; 
tmp = cnt[CNT NEAREST]; 
cnt[CNT NEAREST] = cnt[CNT NEAR]; 
cnt[CNT NEAR] < tmp; 
tmp = near mvs[CNT NEAREST].raw; 
near mvs[CNT NEAREST].raw < near mvs[CNT NEAR].raw; 
near mvs[CNT NEAR].raw - tmp; 
} 


/* Use near_mvs[CNT_BEST] to store the "best" MV. Note that this 
* storage shares the same address as near_mvs[CNT_ZEROZERO]. 
ay 
if (cnt[CNT NEAREST] >= cnt [CNT_BEST] ) 
near mvs[CNT BEST] = near mvs[CNT NEAREST]; 
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static void 
decode split mv(struct mb info *this, 
const struct mb info *lett, 
const struct mb info *above, 
struct vp8 entropy hdr *hdr, 
union mv best mv, 
struct bool decoder *bool) 


const int *partition; 


int 


j, k, mask, partition id; 


partition id = bool read tree(bool, split mv tree, 


split mv probs); 


partition < mv partitions[partition id]; 
this->base.partitioning = partition id; 


for 


( 


Bankoski, 


(j = 0, mask = 0; mask < 65535; j++) 


union mv mv, left mv, above mv; 
enum prediction mode subblock mode; 


/* Find the first subblock in this partition. */ 
for (k = 0; j != partition[k]; k++); 


/* Decode the next MV */ 

left mv = left block mv(this, left, k); 

above mv = above block mv(this, above, k); 

subblock mode = submv ref(bool, left mv, above mv); 


switch (subblock mode) 
1 
case LEFT4X4: 
mv = left mv; 
break; 
case ABOVE4X4: 
mv = above mv; 
break; 
case ZERO4X4: 
mv.raw = 0; 
break; 
case NEW4X4: 
read mv(bool, &mv, hdr-»mv probs); 
mv.d.x += best mv->d.x; 
mv.d.y += best mv->d.y; 
break; 
default: 
assert(0); 
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/* Fill the MVs for this partition */ 


for (; k < 16; k++) 
if (j == partition[k]) 


this->split.mvs[k] = mv; 


mask |= 1 << k; 


static int 


need mc border (union mv mv, int l, int t, int b w, int w, int h) 


{ 


int b, r; 


/* Get distance to edge for top-left pixel */ 


1 += (mv.d.x >> 3); 
t += (mv.d.y >> 3); 


/* Get distance to edge for bottom-right pixel */ 


r < w - (l + bw); 
b < h - (t + b_w); 
return (1 >> 1< 2 || r>>1<3||t>>1<2 || b>"r1<35 


} 


static void 


decode mvs(struct vp8 decoder ctx poss d 
struct mb info *this, 
const struct mb info *left, 
const struct mb info *above, 
const struct mv clamp rect *bounds, 
struct bool decoder *bool) 


struct vp8 entropy hdr *hdr = £ctx->entropy_hdr; 


union mv near mvs[4]; 
union mv clamped best mv; 
int mv cnts[4]; 
unsigned char probs[4]; 


enum (BEST, NEAREST, NEAR); 
int x, y, w, h, b; 


this->base.ref frame = bool_get (bool, hdr->prob last) 
? 2 + bool get(bool, hdr-»prob gf) 


1; 
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find near mvs(this, this - 1, above, 
ctx->reference hdr.sign bias, near mvs, mv cnts); 
probs[0] = mv counts to probs[mv cnts[0]][0]; 
probs[1] = mv counts to probs[mv cnts[1]][1]; 
probs[2] = mv counts to probs[mv cnts[211][2]; 
probs[3] = mv counts to probs[mv cnts[311[3]; 


this->base.y mode = bool read tree(bool, mv ref tree, probs); 
this->base.uv mode = this->base.y mode; 


this->base.need mc border = 0; 

x — (-bounds->to left - 128) >> 3; 
y (-bounds->to top - 128) >> 3; 
w — ctx-»mb cols * 16; 

h < ctx-»mb rows * 16; 


switch (this->base.y mode) 

{ 

case NEARESTMV: 
this->base.mv 
break; 

case NEARMV: 
this->base.mv = clamp mv (near mvs[NEAR], bounds); 
break; 

case ZEROMV: 
this->base.mv.raw = 0; 
return; //skip need mc border check 

case NEWMV: 
clamped best mv - clamp mv (near mvs[BEST], bounds); 
read mv(bool, g«this->base.mv, hdr->mv probs); 
this->base.mv.d.x += clamped best mv.d.x; 
this->base.mv.d.y += clamped best mv.d.y; 
break; 

case SPLITMV: 

{ 


clamp mv (near_mvs [NEAREST], bounds); 


union mv chroma mv[4] = (110))); 


clamped best mv = clamp_mv(near_mvs[BEST], bounds); 

decode split mv(this, left, above, hdr, &clamped best mv, 
bool); 

this->base.mv = this->split.mvs[15]; 
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for (b = 0; b < 16; b++) 


chroma mv[ (b>>141) + (b>>28£2)].d.x += 
this-»5split.mvs[b].d.x; 

chroma_mv[(b>>1&1) + (b>>2&2)].d.y += 
this->split.mvs[b].d.y; 


if (need mc border (this->split.mvs[b], 
x + (b & 3 * 4, y + (b s 73), 4, w, h)) 
{ 
this->base.need mc border = 1; 
break; 


for (b = 0; b < 4; b++) 


chroma mv[b].d.x += 4 + 8 % (chroma mv[b].d.x >> 31); 
chroma mv[b].d.y += 4 + 8 * (chroma mv[b].d.y >> 31); 
chroma mv[b].d.x /< 4; 
chroma mv[b].d.y /< 4; 


//note we' re passing in non-subsampled coordinates 


if (need mc border (chroma mv[b], 

x + (be 1) * 8 y + (b >> 1) * 8, 16, w, h)) 
this->base.need mc border - 1; 
break; 


} 


return; //skip need_mc_border check 
} 
default: 

assert (0); 


} 


if (need mc border(this-»base.mv, x, y, 16, w, h)) 
this->base.need mc border - 1; 
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void 
vp8 dixie modemv process row(struct vp8 decoder ctx *ctx, 
struct bool decoder *bool, 
int row, 
int start_col, 
int num_cols) 
{ 
struct mb_info *above, *this; 
unsigned int col; 


struct mv_clamp_rect bounds; 


this = ctx-»mb info rows[row] + start_col; 
above = ctx-»mb info rows[row - 1] + start col; 


/* Calculate the eighth-pel MV bounds using a 1 MB border. */ 


bounds.to left = —-((start col + 1) << 7); 

bounds.to right = (ctx-»mb cols - start col) << 7; 
bounds.to top = —-((row + 1) << 7); 

bounds.to bottom = (ctx-»mb rows - row) << 7; 

for (col = start col; col « start col + num cols; col++) 


{ 
if (ctx->segment hdr.update map) 
this->base.segment id = read segment id(bool, 
&ctx-»segment hdr); 


if (ctx->entropy hdr.coeff skip enabled) 
this->base.skip coeff - bool get (bool, 
ctx-»entropy hdr.coeff skip prob); 


if (ctx->frame hdr.is keyframe) 
if (!ctx->segment hdr.update map) 
this->base.segment id = 0; 


decode kf mb mode(this, this - 1, above, bool); 
} 
else 
{ 
if (bool_get (bool, ctx-»entropy hdr.prob inter)) 
decode mvs(ctx, this, this - 1, above, &bounds, 
bool); 
else 
decode intra mb mode(this, &ctx-»entropy hdr, bool); 


bounds.to left -< 16 << 3; 
bounds.to right -< 16 << 3; 
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/* Advance to next mb */ 
thistt; 
abovett; 


void 

vp8_dixie_modemv_init (struct vp8 decoder ctx *ctx) 
unsigned int mbi w, mbi h, i; 
struct mb info "mbi; 


mbi w < ctx->mb cols t 1; /* For left border col */ 
mbi h < ctx-»mb rows t 1; /* For above border row %/ 


if (ctx->frame hdr.frame size updated) 
{ 
free(ctx-»mb info storage); 
ctx->mb info storage = NULL; 
free(ctx-»mb info rows storage); 
ctx->mb info rows storage = NULL; 


} 


if (!ctx-»mb info storage) 
ctx-»mb info storage = calloc(mbi w * mbi h, 
Sizeof(*ctx-»mb info storage)); 


if (!ctx-»mb info rows storage) 


ctx-»mb info rows storage - calloc(mbi h, 
Sizeof(*ctx-»mb info rows storage)); 


/* Set up row pointers */ 
mbi < ctx-»mb info storage t 1; 


for (i = 0; i < mbi h; i++) 

{ 
ctx-»mb info rows storage[i] = mbi; 
mbi += mbi w; 


} 


ctx-»mb info rows = ctx-»mb info rows storage + 1; 
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20. 


void 
vp8 dixie modemv destroy(struct vp8 decoder ctx *ctx) 
{ 
free(ctx-»mb info storage); 
ctx->mb info storage = NULL; 
free(ctx-»mb info rows storage); 
ctx->mb info rows storage = NULL; 


} 
+5 End. code block zremo SE PE RS o S SSS sees 
12. modemv.h 


-cc— Begin code block === === 255-2225 


/* 
* Copyright (c) 2010, 2011, Google Inc. All rights reserved. 

* 

* Use of this source code is governed by a BSD-style license 

* that can be found in the LICENSE file in the root of the source 
* tree. An additional intellectual property rights grant can be 

x found in the file PATENTS. All contributing project authors may 
* be found in the AUTHORS file in the root of the source tree. 

A 


#ifndef MODEMV H 
#define MODEMV H 


void 


vp8 dixie modemv init(struct vp8 decoder ctx *ctx); 


void 
vp8 dixie modemv destroy(struct vp8 decoder ctx "ctx); 


void 
vp8 dixie modemv process row(struct vp8 decoder ctx *ctx, 
struct bool decoder *bool, 
int row, 
int start_col, 
int num cols); 
fendif 


Soda) ENA coge DV OK o 23225 ESO m ee 
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20.13.  modemv data.h 


ssm Begin code block -e-e-5o Eno soo 2 SH HS 32 


/* 
* Copyright (c) 2010, 2011, Google Inc. All rights reserved. 

* 

* Use of this source code is governed by a BSD-style license 

* that can be found in the LICENSE file in the root of the source 
x tree. An additional intellectual property rights grant can be 

x found in the file PATENTS. All contributing project authors may 
* be found in the AUTHORS file in the root of the source tree. 

* 


/ 


static const unsigned char kf y mode probs[] < ( 145, 156, 163, 128); 
static const unsigned char kf uv mode probs[] < ( 142, 114, 183); 
static const unsigned char kf b mode probs[10][10][9] < 

{ 


x 


above mode 0 */ 

/* left mode */ 231, 120, 48, 89, 115, 113, 120, 152, 112), 
/* left mode */ 152, 179, 64, 126, 170, 118, 46, 70, 95), 
/* left mode */ 175, 69, 143, 80, 85, 82, T2, 155, 103), 
/* left mode */ 56, 58, 10, 171, 218, 189, d s 13, 152), 
left mode x/ 144, 71, 10, 38, 171, 213, 144, 34, 26), 
/* left mode 114, 26, 17, 163, 44, 195, 21, 10, 173), 
/* left mode */ 121, 24, 80, 195, 26, 62, 44, 64, 85}, 
/* left mode */ 170, 46, 55, 19, 136, 160, 33, 206, 7313; 
/* left mode */ 63, 20, 8, 114, 114, 208, 12; 9, 226], 
/* left mode */ 81, 40, TI, 96, 182, 84, 29, 16, 36) 


{ 


-aaxaa nana ao ao KAN 
7 
* 
KO 0-100140 PFKNn| oO 
* 
“> 


/* above mode 1 */ 

( /* left mode 0 */ 134, 183, 89, 137, 98, 101, 106, 165, 148), 
( /* left mode 1 */ 72, 187, 100, 130, 157, 111, 32, 75, 80], 
( /* left mode 2 */ 66, 102, 167, 99, 74, 62, 40, 234, 128], 
( /* left mode 3 */ 41, 53, 9, 178, 241, 141, 26, 8, 107), 
{ /* left mode 4 */ 104, 79, 12, 27, 217, 255, 87, 17, 7, 
( /* left mode 5 */ 74, 43, 26, 146, 73, 166, 49, 23, 157], 
( /* left mode 6 

( /* left mode 7 

( /* left mode 8 

( /* left mode 9 


*h- 65, 38, 105, 160, .5l, 52, 31; 115, 128), 
*/ 87, 68, 71, 44, 114, 51, 15, 186, 23}, 
*/ AT, 41, 14, 110, 182, 183, 21, 17, 194), 
*/ 66, 45, 25, 102, 197, 189, 23, 18, 22} 


/* above mode 2 */ 

( /* left mode */ 88, 88, 147, 150, 42, 46, 45, 196, 205], 
( /* left mode */ 43, 97,183, IIT, 85, 38, 35, 179, 61}, 
( /* left mode 53, 200, 87, 26, 21, 43, 232, 171], 
( /* left mode x/ 56, 34, 51, 104, 114, 102, 29, 93, 771), 
( /* left mode */ 107, 54, 32, 26, 51, 1, 81, 43, SL, 


Ss COPKN HO 
+ 
%— 
Ww 
Ne} 
~ 
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( /* left mode 5 */ 
( /* left mode 6 */ 
( /* left mode 7 */ 
( /* left mode 8 */ 
( /* left mode 9 */ 

), 

( /* above mode 3 */ 
( /* left mode 0 */ 
( /* left mode 1 */ 
( /* left mode 2 */ 
( /* left mode 3 */ 
( /* left mode 4 */ 
( /* left mode 5 */ 
( /* left mode 6 */ 
( /* left mode 7 */ 
( /* left mode 8 */ 
( /* left mode 9 */ 

), 

( /* above mode 4 */ 
( /* left mode 0 */ 
( /* left mode 1 */ 
( /* left mode 2 */ 
( /* left mode 3 */ 
( /* left mode 4 */ 
( /* left mode 5 */ 
( /* left mode 6 */ 
( /* left mode 7 */ 
( /* left mode 8 */ 
( /* left mode 9 */ 

, 

( /* above mode 5 */ 
( /* left mode 0 */ 
( /* left mode 1 */ 
( /* left mode 2 */ 
( /* left mode 3 */ 
( /* left mode 4 */ 
( /* left mode 5 */ 
( /* left mode 6 */ 
( /* left mode 7 */ 
( /* left mode 8 */ 
( /* left mode 9 */ 

, 

( /* above mode 6 */ 
( /* left mode 0 */ 
( /* left mode 1 */ 
( /* left mode 2 */ 
( /* left mode 3 */ 
( /* left mode 4 */ 
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39, 28, 85, 171, 
34, 22, 116, 206, 
68, 25, 106, 22, 
34, 1:93 * 2355, 25027 
62, 18, 78, 95, 
193, VOL, 35, 159; 
60, 148, 31, 172, 
112,. 113, -77,- 85, 
40, 42, 1, 196, 
100, 80, 8, 43, 
88, 43, 29, 140, 
61, 63, 30, 155, 
142, 78, 78, 16, 
41, 40, 5; 102; 
51, 50, 17, 168, 
125; 98, 42, 88, 
95, 84, 53, 89, 
TD, 9, 123, C 
57, L; By “TE; 
TX5j. "21, 2, 10, 
38, 33, 13; 121, 
41, 10, 67, 138, 
101, 29, 16, 10, 
SN 18, 10, 102, 
117, 20, l5,.-36, 
138, 31, 36, 171, 
67, 87, 58, 169, 
63; "59, 90, 180, 
40, 40, 21, 116, 
57, 46, 22, 24, 
47, 15, 16, 183, 
46, I6. 335; 183; 
6557: 32; , Tar ALES 
40, Š; 9; 115, 
8]; OU 95. ELS; 
104, 55, 44, 218, 
64, 90, 70, 205, 
54, 57; 112, 184, 
30;..34;. 267 133; 
75, 32, T2;. 5L 
Informational 


58, 
23, 
64, 
132; 
85, 


215, 
219, 
179, 
245, 
154, 
166, 

67, 
255, 
211, 
209, 


104, 
128, 
51, 
102, 
102, 
sl; 
TT, 
85; 
102, 
163, 


143, 
128, 


165, 
34, 
1:72; 
188, 
57, 


116, 
255, 


November 2011 


90, 
43, 
36, 
16, 
50, 


89, 
21, 
38, 
10, 
51, 


160, 


98, 
166, 
225, 

76, 

48, 


64), 

73}, 
114}, 
124}, 

51} 


111), 
111], 
114], 
109), 
71], 
154], 
209], 
T Eb, 
221], 
82] 


82], 
45], 
1), 
49], 
6}, 
85}, 
114}, 
26}, 
43}, 
26} 


229}, 
179}, 
154}, 
175}, 
37}, 
183}, 
183}, 
205}, 
223}, 
47} 


226}, 
57}, 
213}, 
134}, 
51}, 
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( /* left mode 5 */ 
( /* left mode 6 */ 
( /* left mode 7 */ 
( /* left mode 8 */ 
( /* left mode 9 */ 

), 

( /* above mode 7 */ 
( /* left mode 0 */ 
( /* left mode 1 */ 
( /* left mode 2 */ 
( /* left mode 3 */ 
( /* left mode 4 */ 
( /* left mode 5 */ 
( /* left mode 6 */ 
( /* left mode 7 */ 
( /* left mode 8 */ 
( /* left mode 9 */ 

), 

( /* above mode 8 */ 
( /* left mode 0 */ 
( /* left mode 1 */ 
( /* left mode 2 */ 
( /* left mode 3 */ 
( /* left mode 4 */ 
( /* left mode 5 */ 
( /* left mode 6 */ 
( /* left mode 7 */ 
( /* left mode 8 */ 
( /* left mode 9 */ 

, 

( /* above mode 9 */ 
( /* left mode 0 */ 
( /* left mode 1 */ 
( /* left mode 2 */ 
( /* left mode 3 */ 
( /* left mode 4 */ 
( /* left mode 5 */ 
( /* left mode 6 */ 
( /* left mode 7 */ 
( /* left mode 8 */ 
( /* left mode 9 */ 


y; 
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39, 1%, 53, 221, 
31, 9, 65, 234, 
88, 31, 35, 67, 
56, .271, 23; 111, 
55, 38, 70; 124, 
102, 61, 71, 37, 
69, 60, 71, 38, 
68, 45, 128, 34, 
62, 17, 19, 70, 
UAM 15; 9: 95 
37; 43; 37; 154; 
63, 97 92, 136, 
86, 6, 28, Di 
56, 8, 17, 1327 
58, 15, 20, 82, 
164, 50, 31, 137, 
51, 103, 44, 131, 
86, 40, 64, 135, 
22, 26, 17,4 2031; 
83, 12, 13, 54, 
45, 16, 21, 91, 
506, Lis <397 1997 
85, 26, 85, 85, 
18, It; 7, 63, 
35, “27,5 10, 146, 
190, -80, 35; 99, 
85, 126, 47, 87, 
TOT, "75, 128, 139, 
56, 41, 15, 176, 
146, 36, 19:5. 309 
JA... 30, TZ I9; 
101, 38, 60, 138, 
138, 45, 61, 62, 
32,. M5. 20; TTT, 
112, 19, T2, 61, 
Informational 


26, 


114, 
15, 
85, 

205, 

102, 


November 2011 


128, 


188, 


4, 


255], 
73}, 
85}, 

192}, 
98} 


192), 
37), 
171), 
70}, 
16}, 
1}, 
85}, 
1}, 
128}, 
40} 


218}, 
158}, 
128}, 
209}, 
28}, 
197}, 
213}, 
LILA 
246), 
128) 


45), 
32), 
85), 
62), 
20), 
138), 
142), 
64), 
163), 
24) 
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static const int kf y mode tree[] - 


{ 


-B PRED, 2, 

4, 6, 

-DC PRED, -V PRED, 
-H PRED, -TM PRED 


y; 


static const int y mode tree[] = 


{ 


-DC PRED, 2, 


4, 6, 
-V PRED, -H PRED, 
-TM PRED, -B PRED 


y; 


static const int uv mode tree[6] = 


{ 


-DC PRED, 2, 


-V PRED, 4, 
-H PRED, 
y; 


static const int b mode tree[18] 


{ 
-B DC PRED, 
-B TM PRED, 
-B VE PRED, 
Sa 127 
-B HE PRED, 
-B RD PRED, 
-B LD PRED, 
-B VL PRED, 
-B HD PRED, 

y; 

static const 


( 


2, 8, 

4, 6, 

-0, -1, 
SN 
10, 12, 
-4, -5, 
7642-4 


y; 
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-TM PRED 


2, 
4, 
6, 


10, 
-B_VR_PRED, 
14, 

16, 
-B_HU_PRED 


int small_mv_tree[14] = 


/* 0 = DC_NODE */ 
/* 1 = TM NODE */ 
/* 2 < VE NODE */ 
/* 3 = COM NODE */ 
/* 4 < HE NODE */ 
/* 5 < RD NODE */ 
/* 6 = LD NODE */ 
/* 7 < VL NODE */ 
/* 8 < HD NODE */ 
Informational 
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static const int mv ref tree[8] < 
{ 
-ZEROMV, 2, 
-NEARESTMV, 4, 
-NEARMV, 6, 
-NEWMV, -SPLITMV 
y; 
static const int submv ref tree[6] = 
{ 
-LEFTAXA, 2, 
—-ABOVE4X4, 4, 
—-ZERO4X4, -NEWAXA 
y; 
static const int split mv tree[6] = 


y 
static const unsigned char default b mode probs[] = 
111207 90y. 79, 133, “87, :85, 80, lll, 151); 
static const unsigned char mv counts to probs[6][4] < 
{ 

{ 7, 1, 1, 143 ), 

( 14, 18, 14, 107 ), 

{ 135, 64, 57, 68 }, 

1. 60; 56, 128, .:65- b, 

( 159, 134, 128, 34 ), 

( 234, 188, 128, 28 ) 
h 
static const unsigned char split mv probs[3] < 
( 110, 111, 150); 


static const unsigned char submv ref probs2[5][3] = 
{ 

147, 136, 18 }, 

106, 145, 1}, 

179, 121, 1}, 

223, 1, 34 ), 

208, lg ^W. 


HA a c4 oc 


y; 
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20. 


const static int mv partitions[4][16] < 


(0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1 hy 
(0, 0, 1, 1, 0, 0, 1, 1, 0, 0, 1, 1, 0, 0, 1, 1 hy 
(0, 0, 1, 1, 0, 0, 1, 1, 2, 2, 3, 3, 2, 2, 3, 3 , 
[0, 1, 2, 37 4, 5, Oy T, 8; 9, 10, ll, 12, 13, 14, 15 ) 


ecco und code block —----—---9-2—9--——————-6-09—-------ccooc 


predict.c 


===="Begán code.block.-------------—4o5-o---co-ueocemenoeocm- ee 


PAG: 
+ + + A AE F F F X 


eS 


Copyright (c) 2010, 2011, Google Inc. All rights reserved. 


Use of this source code is governed by a BSD-style license 

that can be found in the LICENSE file in the root of the source 
tree. An additional intellectual property rights grant can be 
found in the file PATENTS. All contributing project authors may 
be found in the AUTHORS file in the root of the source tree. 


#include "dixie.h" 
finclude "predict.h" 
#include "idct add.h" 
#include "mem.h" 
#include <assert.h> 
finclude <string.h> 


enum 


{ 


y; 


BORDER_PIXELS = 16, 


static const filter t sixtap filters[8] = 


{ 


y; 


1 0, 0, 128, 0, 0, 0), 
{Op 56, 123; TO. apos dus 
[29 neki, L08, 36, -8, 1}, 
[Dy -9, 93, 50, -6, 0}, 
Cap eier $7, Jlr oier 3] 
(0, -6, 50, 93, -9, 09}, 
GIy nag > BG, 1085 i 235 
(05. le X24 1235 «6, wk) 
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static const filter t bilinear filters[8] < 


( 


(0, 0, 128, 0, 0, 0), 
[205 0, 112; 16, 0, 0 }, 
(0, 0, 96, 32, 0, 0), 
(0, 0, 80, 48, 0, 0), 
(0, 0, 64, 64, 0, 0 ), 
(0, 0, 48, 80, 0, 0), 
(0, 0, 32, 96, 0, 0), 
{o0 - 205 16, 112, 0, 0) 


y; 


static void 
predict_h_nxn(unsigned char *predict, 


int stride, 
int n) 
{ 
unsigned char *left = predict - 1; 
int i, j; 
for (i = 0; i < n; itt) 


for (j = 0; j < n; jtt 
predict[i *stride + j] = left[i * stride]; 


static void 
predict_v_nxn(unsigned char *predict, 


int stride, 
int n) 
{ 
unsigned char *above = predict - stride; 
int i, j; 
for (i = 0; i < n; itt) 
for (j = 0; j < n; J++) 
predict[i *stride + j] = above[jl; 
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static void 
predict tm nxn(unsigned char *predict, 


} 


int stride, 
int n) 


/* Transposes the left column to the top row for later 
* consumption by the idct/recon stage 


*/ 
unsigned char *left - predict - 1; 
unsigned char *above - predict - stride; 
unsigned char p = above[-1]; 

int i, ji 


for (i = 0; i < n; itt) 
predict[i] = CLAMP 255("left + aboveli] - p); 


predict += stride; 
left += stride; 


static void 
predict dc nxn(unsigned char *predict, 


int stride, 
int n) 


unsigned char *left - predict - 1; 


unsigned char *above - predict - stride; 
int i, j, de - 0; 
for (i = 0; i < n; itt) 


dc += "left + aboveli]; 
left += stride; 


2011 
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switch (n) 

{ 

case 16: 
dc = (dc + 16) >> 5; 
break; 

case 8: 
dc = (de + 8) >> 4; 
break; 

case 4: 
dc = (dc + 4) >> 3; 
break; 

} 

for (i = 0; i < n; itt) 
for (j = 0; j < n; j**) 


predict[i *stride + j] 


static void 


= dc; 


predict_ve_4x4 (unsigned char *predict, 


{ 


L 


nt 


unsigned char *above = 


int 


i, ji 


stride) 


predict[0] = (above[-1] + 2 * above[0] 
predict[1] = (above[ 0] + 2 * above[1] 
predict[2] = (above[ 1] + 2 * above[2] 
predict[3] = (above[ 2] + 2 * above[3] 
for (i = 1; i < 4; i++) 
for (j = 0; j < 4; j++) 
predict[i *stride + j] = 
} 
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predict - stride; 


++++ 


and Decoding Guide 


above[1] 
above [2] 
above[3] 
above[4] 


predict[j]; 


++++ 
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2) >> 2; 
2) >> 2; 
2) >> 2; 
2) >> 2; 
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static void 
predict he 4x4 (unsigned char *predict, 


{ 


i 
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nt stride) 


unsigned char *left = predict - 1; 


predict [0] 
predict [1] 
predict [2] 
predict [3] 


(left [-stride] + 2 * left[0] 
left[stride] + 2) >> 2; 


predict += stride; 
left += stride; 


predict [0] 
predict [1] 
predict [2] 
predict [3] 


(left [-stride] + 2 * left[0] 
left[stride] + 2) >> 2; 


predict += stride; 
left += stride; 


predict [0] 
predict [1] 
predict [2] 
predict [3] 


(left [-stride] + 2 * left[0] 
left[stride] + 2) >> 2; 


predict += stride; 
left += stride; 


predict [0] 
predict [1] 
predict [2] 
predict [3] 


Bankoski, 


et al. 


(left [-stride] + 2 * left[0] 
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+ left[0] 
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+ 2) >> 2; 
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static void 
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predict ld 4x4 (unsigned char *predict, 


int 
{ 


unsigned char *above 


int predo, 
predict[0] = pred0 = 
predict[1] = predl = 
predict[2] = pred2 = 
predict[3] = pred3 = 
predict += stride; 
predict[0] = predl; 
predict[1] = pred2; 
predict[2] = pred3; 
predict[3] = pred4 = 
predict += stride; 
predict[0] = pred2; 
predict[1] = pred3; 
predict[2] = pred4; 
predict[3] = pred5 = 
predict += stride; 
predict[0] = pred3; 
predict[1] = pred4; 
predict[2] = pred5; 
predict[3] = pred6 = 
} 
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stride) 
= predict - stride; 
predl, pred2, pred3, pred4, 
(above[0] + 2 * above[1] + 
above[2] + 2) >> 2; 
(above[1] + 2 % above[2] + 
above[3] + 2) >> 2; 
(above[2] + 2 * above[3] + 
above[4] + 2) >> 2; 
(above[3] + 2 * above[4] + 
above[5] + 2) >> 2; 
(above[4] + 2 * above[5] + 
above[6] + 2) >> 2; 
(above[5] + 2 * above[6] + 
above[7] + 2) >> 2; 
(above[6] + 2 * above[7] + 
above[7] + 2) >> 2; 
Informational 


pred5, 
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pred6; 
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static void 
predict rd 4x4 (unsigned char *predict, 


int stride) 
{ 

unsigned char *left = predict - 1; 
unsigned char *above = predict - stride; 
int pred0, predl, pred2, pred3, pred4, pred5, pred6; 
predict[0] = predo = 

(left[ 0] + 2 * above[-1] + above[0] + 2) >> 2; 
predict[1] < predl - 

(above[-1] + 2 * above[ 0] + above[1] + 2) >> 2; 
predict[2] = pred2 = 

(above[ 0] + 2 * above[ 1] + above[2] + 2) >> 2; 
predict[3] = pred3 = 


(above[ 1] + 2 * above[ 2] + above[3] + 2) >> 2; 
predict += stride; 


predict[0] = pred4 = 
(left [stride] + 2 * left[0] + above[-1] + 2) >> 2; 
predict[1] = pred0; 
predict[2] = predl; 
predict[3] = pred2; 
predict += stride; 
predict[0] = pred5 = 
(left [stride*2] + 2 * left[stride] + left[0] + 2) >> 2; 
predict[1] = pred4; 
predict [2] = pred0; 
predict[3] = predl; 
predict += stride; 
predict[0] = pred6 = (left[stride"3] + 2 * left[stride"2] + 
left[stride] + 2) >> 2; 
predict[1] = pred5; 
predict[2] = pred4; 
predict [3] = pred0; 
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static void 
predict vr 4x4 (unsigned char *predict, 


int stride) 
{ 

unsigned char *left = predict - 1; 

unsigned char *above = predict - stride; 

int pred0, predl, pred2, pred3, pred4, pred5, pred6, 

pred7, pred8, pred9; 

predict[0] = pred0 = (above[-1] + above[0] + 1) >> 1; 

predict[1] = predl = (above[ 0] + above[1] + 1) >> 1; 

predict[2] = pred2 = (above[ 1] + above[2] + 1) >> 1; 

predict[3] = pred3 = (above[ 2] + above[3] + 1) >> 1; 

predict += stride; 

predict[0] = pred4 = (left[ 0] + 2 * above[-1] + 
above[0] + 2) >> 2; 

predict[1] = pred5 = (above[-1] + 2 * above[ 0] + 
above[1] + 2) >> 2; 

predict[2] = pred6 = (above[ 0] + 2 * above[ 1] + 
above[2] + 2) >> 2; 

predict[3] = pred7 = (above[ 1] + 2 * above[ 2] + 
above[3] + 2) >> 2; 

predict += stride; 

predict[0] = pred8 = 

(left [stride] + 2 * left[0] + above[-1] + 2) >> 2; 

predict[1] = pred0; 

predict [2] = predl; 

predict [3] = pred2; 

predict += stride; 

predict[0] = pred9 = 

(left [stride*2] + 2 * left[stride] + left[0] + 2) >> 2; 

predict[1] = pred4; 

predict[2] = pred5; 

predict[3] = pred6; 
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static void 
predict vl 4x4 (unsigned char *predict, 


int stride) 
{ 
unsigned char *above = predict - stride; 
int pred0, predl, pred2, pred3, pred4, pred5, pred6, 
pred7, pred8, pred9; 

predict[0] = pred0 = (above[0] + above[1] + 1) >> 1; 

predict[1] = predl = (above[1] + above[2] + 1) >> 1; 

predict[2] = pred2 = (above[2] + above[3] + 1) >> 1; 

predict[3] = pred3 = (above[3] + above[4] + 1) >> 1; 

predict += stride; 

predict[0] = pred4 = (above[0] + 2 * above[1] 
above[2] + 2) >> 2; 

predict[1] = pred5 = (above[1] + 2 * above[2] 
above[3] + 2) >> 2; 

predict[2] = pred6 = (above[2] + 2 * above[3] 
above[4] + 2) >> 2; 

predict[3] = pred7 = (above[3] + 2 * above[4] 
above[5] + 2) >> 2; 


predict += stride; 


predict[0] = predl; 
predict[1] = pred2; 
predict[2] = pred3; 
predict[3] = pred8 = (above[4] + 2 * above[5] + 


above[6] + 2) >> 2; 


predict += stride; 


predict[0] = pred5; 
predict[1] = pred6; 
predict[2] = pred7; 
predict[3] = pred9 = (above[5] + 2 * above[6] + 
above[7] + 2) >> 2; 
} 
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static void 
predict hd 4x4 (unsigned char *predict, 


{ 


Bankoski, 


unsigned char *left 


unsigned char *above 


int 

predict [0] 
predict [1] 
predict [2] 
predict [3] 
predict += 


predict [0] 
predict [1] 


predict [2] 
predict [3] 
predict += 


predict [0] 
predict [1] 


predict [2] 
predict [3] 
predict += 


predict [0] 
predict [1] 


predict [2] 
predict [3] 
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int stride) 
predict - 1; 
= predict - stride; 
pred0, predl, pred2, pred3, pred4, pred5, pred6, 
pred7, pred8, pred9; 
= pred0 = (left[ 0] + above[-1] + 1) >> 1; 
= predl = (left[ 0] + 2 * above[-1] + 
above[0] + 2) >> 2; 
= pred2 = (above[-1] + 2 * above[ 0] + 
above[1] + 2) >> 2; 
= pred3 = (above[ 0] + 2 * above[ 1] + 
above[2] + 2) >> 2; 
stride; 
= pred4 = (left[stride] + left[0] + 1) >> 1; 
= pred5 = (left[stride] + 2 * left[0] + 
above[-1] + 2) >> 2; 
= pred0; 
= predl; 
stride; 
= pred6 = (left[stride*2] + left[stride] + 1) >> 1; 
= pred7 = (left[stride*2] + 2 * left[stride] + 
left [0] + 2) >> 2; 
= pred4; 
= pred5; 
stride; 
= pred8 = (left[stride*3] + left[stride*2] + 1) >> 1; 
= pred9 = (left[stride*3] + 2 * left[stride*2] + 
left [stride] + 2) >> 2; 
= pred6; 
= pred7; 
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static void 

predict hu 4x4 (unsigned 
int 

{ 


unsigned char *left 


int predo 
predict[0] = pred0 = 
predict[1] < predl - 
predict[2] = pred2 = 
predict[3] = pred3 = 
predict += stride; 
predict[0] = pred2; 
predict[1] = pred3; 
predict[2] = pred4 = 
predict[3] = pred5 = 
predict += stride; 
predict[0] = pred4; 
predict[1] = pred5; 
predict[2] = pred6 = 
predict[3] = pred6; 
predict += stride; 
predict[0] = pred6; 
predict[1] = pred6; 
predict [2] = pred6; 
predict[3] = pred6; 


static void 
predict_h_16x16 (unsigned 
{ 


predict_h_nxn(predict, 


} 


static void 
predict_v_16x16 (unsigned 
{ 


predict_v_nxn(predict, 


} 
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char *predict, 
stride) 


predict - 1; 


, predl, pred2, pred3, pred4, pred5, pred6; 
(left [stride*0] + 
left [stride*1] + 1) >> 1; 
(left [stride*0] + 2 * left[stride*1] + 
left[stride*2] + 2) >> 2; 
(left [stride*1] + left[stride*2] + 1) >> 1; 
(left [stride*1] + 2 * left[stride*2] + 
left [stride*3] + 2) >> 2; 
(left [stride*2] + left[stride*3] + 1) >> 1; 
(left [stride*2] + 2 * left[stride*3] + 
left[stride*3] + 2) >> 2; 


left[stride*3]; 


char *predict, int stride) 


stride, 16); 


char *predict, int stride) 


stride, 16); 
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static void 
predict tm 16xl6(unsigned char *predict, int stride) 
( 

predict tm nxn(predict, stride, 160); 


} 


static void 
predict_h_8x8 (unsigned char *predict, int stride) 
{ 

predict_h_nxn(predict, stride, 8); 


} 


static void 
predict_v_8x8 (unsigned char *predict, int stride) 
{ 

predict_v_nxn(predict, stride, 8); 


} 


static void 
predict_tm_8x8 (unsigned char *predict, int stride) 
{ 

predict_tm_nxn(predict, stride, 8); 


} 


static void 
predict tm 4x4(unsigned char "predict, int stride) 
{ 

predict_tm_nxn(predict, stride, 4); 


} 


static void 
copy down (unsigned char *recon, 
int stride) 
{ 
/* Copy the four pixels above-right of subblock 3 to 
* above-right of subblocks 7, 11, and 15 
ard 
uint32 t tmp, *copy = (void *) (recon + 16 - stride); 
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stride - stride / sizeof(unsigned int); 

tmp — *copy; 

copy += stride * 4; 

*copy = tmp; 

copy += stride * 4; 

*copy = tmp; 

copy += stride * 4; 

*copy = tmp; 


static void 

b_pred(unsigned char *predict, 
int stride, 
struct mb_info *mbi, 
short *coeffs) 


int ig 


copy down(predict, stride); 


for i < 16; i++) 


{ 


unsigned char *b_predict = predict + 


switch 


{ 


case 


(mbi->split.modes[i]) 


B_DC_PRED: 
predict dc nxn(b predict, 
break; 
B TM PRED: 
predict tm 4x4 (b predict, 
break; 
B VE PRED: 
predict ve 4x4 (b predict, 
break; 
B HE PRED: 
predict he 4x4 (b predict, 
break; 
case B LD PRED: 
predict ld 4x4 (b predict, 
break; 
B RD PRED: 
predict rd 4x4 (b predict, 
break; 
B VR PRED: 
predict vr 4x4(b predict, 
break; 


case 


case 


case 


case 


case 
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stride, 


(i 8 3) 


4); 


stride); 


stride); 


stride); 


stride); 


stride); 


stride); 


* 
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case B VL PRED: 
predict vl 4x4(b predict, stride); 
break; 

case B HD PRED: 
predict hd 4x4(b predict, stride); 
break; 

case B HU PRED: 
predict hu 4x4(b predict, stride); 
break; 

default: 
assert (0); 


} 


vp8 dixie idct add(b predict, b predict, stride, 


coeffs += 16; 


if ((i & 3) == 3) 
{ 

predict += stride * 4; 
} 


static void 


fixup_dc_coeffs(struct mb_info *mbi, 
short *coeffs) 
{ 
short y2[16]; 
int dj 


vp8 dixie walsh(coeffs t 24 * 16, y2); 


for (i = 0; i < 16; i++) 
coeffs[i*16] = y2li]; 


static void 


predict_intra_luma(unsigned char *predict, 
int stride, 
struct mb info  "mbi, 
short *coeffs) 
{ 
if (mbi->base.y_mode == B_PRED) 
b_pred(predict, stride, mbi, coeffs); 
else 
Bankoski, et al. Informational 
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coeffs); 
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int i; 


Switch (mbi->base.y mode) 

case DC PRED: 
predict dc nxn(predict, stride, 16); 
break; 

case V PRED: 
predict v 16x16(predict, stride); 
break; 

case H PRED: 
predict h 16x16(predict, stride); 
break; 

case TM PRED: 
predict tm 16x16(predict, stride); 
break; 

default: 
assert(0); 


} 
fixup_dc_coeffs(mbi, coeffs); 


for (i = 0; i < 16; i++) 

{ 
vp8 dixie idct add(predict, predict, stride, coeffs); 
coeffs += 16; 
predict += 4; 


if ((1 € 3) == 3) 
predict += stride * 4 - 16; 


static void 


predict intra chroma (unsigned char *predict u, 
unsigned char *predict v, 
int stride, 
struct mb info  "mbi, 
short *coeffs) 

{ 

int. i; 
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switch (mbi->base.uv mode) 

case DC PRED: 
predict dc nxn(predict u, 
predict dc nxn(predict v, 
break; 

case V PRED: 
predict v 8x8 (predict u, 
predict v 8x8 (predict v, 
break; 

case H PRED: 
predict h 8x8 (predict u, 
predict h 8x8 (predict v, 
break; 

case TM PRED: 
predict tm 8x8 (predict u, 
predict tm 8x8 (predict v, 
break; 

default: 
assert (0); 


} 


coeffs += 16 * 16; 
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8); 
8); 


stride, 
stride, 


stride); 
stride); 


stride); 
stride); 


stride); 
stride); 


for (i = 16; i < 20; i++) 
{ 
vp8 dixie idct add(predict u, predict u, stride, coeffs); 
coeffs += 16; 
predict u += 4; 
if (i & 1) 
predict u += stride * 4 - 8; 
} 
for (i = 20; i < 24; i++) 
{ 
vp8 dixie idct add(predict v, predict v, stride, coeffs); 


coeffs += 16; 
predict v += 4; 


if (i & 1) 


predict_v += stride * 4 - 8; 


Bankoski, et al. 


Informational 


[Page 213] 


RFC 6386 VP8 Data Format and Decoding Guide November 2011 


static void 


sixtap horiz (unsigned char *output, 
int output stride, 
const unsigned char *reference, 
int reference stride, 
int cols, 
int rows, 
const filter_t filter 


int r, c, temp; 


for (r = 0; r < rows; rtt) 
{ 
for (c = 0; c < cols; c++) 
{ 
temp = (reference[-2] * filter[0]) + 
(reference[-1] * filter[1]) + 
(reference[ 0] * filter[2]) + 
(reference[ 1] * filter[3]) + 
(reference[ 2] * filter[4]) + 
(reference[ 3] * filter[5]) + 
64; 
temp >>= 7; 
output[c] = CLAMP 255 (temp); 
reference++; 
} 
reference += reference stride - cols; 
output += output stride; 
} 
} 
static void 
sixtap_vert (unsigned char *output, 
int output_stride, 
const unsigned char *reference, 
int reference_stride, 
int cols, 
int rows, 
const filter_t filter 


int r, c, temp; 


for (r = 0; r < rows; rtt) 


{ 
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for (c = 0; c < cols; ctt) 
{ 
temp = (reference[-2*reference stride] 


reference [-1*reference_stride] 
reference[ O*reference stride] 
reference[ l"reference stride] 
reference[ 2*reference stride] 
reference[ 3*reference stride] 


————— 


64; 
temp >>= 7; 
output[c] < CLAMP 255(temp); 
reference++; 
) 
reference += reference stride - cols; 


output += output_stride; 


static void 


sixtap_2d(unsigned char *output, 
int output_stride, 
const unsigned char *reference, 
int reference_stride, 
int cols, 
int rows, 
int mx, 
int my, 
const filter t filters[8] 


) 


+ + AR Xo Xo X 


November 2011 


filter[0] 
filter[1] 
filter[2] 
filter[3] 
filter[4] 
filter[5] 


++++++ 


) 
) 
) 
) 
) 
) 


DECLARE_ALIGNED (16, unsigned char, temp[16*(16+5)]); 


sixtap_horiz (temp, 16, 
reference - 2 * reference stride, 
cols, rows + 5, filters[mx]); 
Sixtap vert(output, output stride, 
temp t 2 * 16, 16, 
cols, rows, filters[my]); 


struct img index 


{ 
unsigned char *y, *u, *v; 
int stride, uv_stride; 


y; 
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static const unsigned char * 


filter block(unsigned char *output, 
const unsigned char  *reference, 
int stride, 
const union mv *mv, 
const filter t filters[8]) 


int mx, my; 


/* Handle 0,0 as a special case. TODO: Does this make it any 
* faster? 
87 
if (!mv->raw) 
return reference; 


mx = mv->d.x & 7; 

my = mv->d.y & 7; 

reference += ((mv->d.y >> 3) * stride) + (mv->d.x >> 3); 
if (mx | my) 


{ 
sixtap_2d(output, stride, reference, stride, 4, 4, mx, my, 
filters); 
reference = output; 


} 


return reference; 


static void 


recon 1 block(unsigned char *output, 
const unsigned char  *reference, 
int stride, 
const union mv *mv, 
const filter t filters[8], 
short *coeffs, 
struct mb info *mbi, 
int b 


) 


const unsigned char *predict; 


predict = filter block(output, reference, stride, mv, filters); 
vp8 dixie idct add(output, predict, stride, coeffs + 16 * b); 
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calculate chroma splitmv(struct mb info *mbi, 


int temp; 
union mv mv; 


int b, 
int full pixel) 


temp = mbi->split.mvs[b].d.x + 
mbi->split.mvs[btl].d.x + 
mbi-»split.mvs[b*4].d.x + 
mbi->split.mvs[bt5].d.x; 

if (temp < 0) 

temp -= 4; 
else 
temp t< 4; 

mv.d.x = temp / 8; 

temp = mbi->split.mvs[b].d.y + 
mbi->split.mvs[b+1].d.y + 
mbi->split.mvs[b+4].d.y + 
mbi->split.mvs[bt5].d.y; 

if (temp < 0) 

temp -= 4; 
else 
temp += 4; 

mv.d.y = temp / 8; 

if (full pixel) 

{ 

mv.d.x &= ^7; 
mv.d.y &= ^7; 
} 
return mv; 
} 
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We rely on the reconstructed border having the same stride 


* as the reference buffer because the filter block can't adjust the 


* stride with its return value, 
kJ 

static void 

build mc border (unsigned char 


only the reference pointer. 


*dst, 


const unsigned char *src, 


int 
int 
int 
int 
int 
int 
int 


) 


const unsigned char *ref_row; 


/* Get a pointer to the start of the 


stride, 


real data for this row */ 


ref row = src - x - y * stride; 
if (y >< h) 
ref row += (h - 1) * stride; 
else if (y > 0) 
ref row += y * stride; 
do 
{ 
int left, right = 0, copy; 
left = x < 0 ? -x : 0; 
if (left > b w) 
left = bw; 
if (x t b w > w) 
right < x t b w - w; 
if (right > b w) 
right < b w; 
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copy = b w - left - right; 
if (left) 
memset (dst, ref row[0], left); 
if (copy) 
memcpy (dst + left, ref row + x + left, copy); 
if (right) 
memset (dst + left + copy, ref row[w-1], right); 


dst += stride; 


ytt; 
if (y « h && y > 0) 
ref row += stride; 
} 
while (--b h); 


static void 


recon 1 edge block(unsigned char 
unsigned char 
const unsigned char 
int 
const union mv 
const filter t 
short 
struct mb info 
int 
int 
int 
int 
int 
) 
{ 

const unsigned char *predict; 

int b = start b; 

const int b w < 4; 

const int b h < 4; 

x += mv->d.x >> 3; 

y += mv->d.y >> 3; 
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/* Need two pixels left/above, 3 right/below for 6-tap */ 


if (x«2]||l x+bw-1+3>=w || y<2 || 
y + bh - 1 + 3 >=h) 
{ 


reference += (mv->d.x >> 3) + (mv->d.y >> 3) 


build mc border (emul block, 


reference - 2- 2 * stride, 


* stride; 


stride, 


x - 2, y - 2, b w t 5, b h t 5, w, h); 


reference - emul block t 2 * stride t 2; 


reference -- (mv->d.x >> 3) + (mv->d.y >> 3) 


} 


* stride; 


predict = filter_block(output, reference, stride, mv, filters); 
vp8 dixie idct add(output, predict, stride, coeffs + 16 * b); 


static void 
predict inter emulated edge(struct vp8 decoder ctx 
struct img index 
short 
struct mb info 
int 
int 


AGE, 
*img, 
*coeffs, 
*mbi, 
mb col, 
mb row) 


/* TODO: Move this into its own buffer. This only works because 


* we still have a border allocated. 


S% 


unsigned char *emul_block = ctx->frame_strg[0].img.img_data; 


unsigned char *reference; 
unsigned char *output; 


ptrdiff t reference offset; 

int w, h, x, y, b; 

union mv chroma mv[4]; 

unsigned char *u = img->u, "v = img->v; 

int full pixel = ctx->frame hdr.version == 3; 
x — mb col * 16; 

y = mb row * 16; 

w — ctx-»mb cols * 16; 

h < ctx-»mb rows * 16; 


output = img->y; 


reference offset - ctx->ref frame offsets[mbi->base.ref frame]; 
reference = output + reference offset; 
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(mbi-»base.y mode 


union mv uvmv; 


K Xx 


uvmv.d.x &- 


mbi-»base. 
= (uvmv. 
= (uvmv. 


(full pixel) 


Format and Decoding Guide 


!< SPLITMV) 


(uvmv.d.x >> 31) 
(uvmv.d.y >> 31) 


"s 


uvmv.d.y &= ^7; 


} 


chroma_mv [0] 
chroma_mv[1] 
chroma_mv [2] 
chroma_mv [3] 


else 


chroma_mv [0] 
chroma_mv[1] 
chroma_mv [2] 
chroma_mv [3] 


/* Luma */ 


FOT 


{ 


Bankoski, 


(b = 0; 


b < 16; 


union mv *ymv; 


if 
ymv = 
else 
ymv = 


recon_l_edge_block (output, 
img->stride, 
Y, 


mbi, x, 


x += 4; 
output += 4; 


reference += 4; 


et al. 


(mbi->base.y mode 
&mbi-»base.mv; 


W, 


uvmv; 
uvmv; 
uvmv; 
uvmv; 


calculate_chroma_splitmv (mbi, 
calculate_chroma_splitmv (mbi, 
calculate_chroma_splitmv (mbi, 
calculate_chroma_splitmv (mbi, 


b++) 


SPLITMV) 


mbi->split.mvs + b; 


emul_block, 
ymv, ctx->subpixel_filters, 
h, b); 


Informational 


* 


2) 
2) 


November 2011 


, 


full pixel); 
full pixel); 
) 
) 


, 


full pixel 
full pixel 


, 


reference, 
coeffs, 
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if ((b & 3) << 3) 
{ 

x -= 16; 

he og 


output += 4 * img->stride - 16; 
reference += 4 * img->stride - 16; 


x — mb col * 16; 

y = mb row * 16; 

/* Chroma */ 

x >>= 1; 

y >>= 1; 

w >>< 1; 

h >>= 1; 

for (b = 0; b < 4; b++) 


recon_l_edge_block (u, emul_block, 
img->uv_stride, 
&chroma mv[b], 
coeffs, mbi, x, 
recon 1 edge block(v, emul block, 
img->uv stride, 
&chroma mv[b], 


Y, W, h, b T 


u + reference_ 
ctx->subpixel_ 


v + reference 


November 2011 


offset, 


filters, 
16); 
offset, 


u += 4; 

v += 4; 

x += 4; 

if (b & 1) 

{ 
x —— 8; 
y += 4; 
u += 4 * 
v += 4 % 


} 


static void 

predict_inter (struct 
struct 
short 
struct 


Bankoski, et al. 


ctx->subpixel filters, 


coeffs, mbi, x, y, w, h, b t 20); 
img->uv stride 8; 
img->uv stride 8; 
vp8 decoder ctx  *ctx, 
img index *img, 

*coeffs, 

mb info *mbi) 
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unsigned char *y - img-»y; 
unsigned char *u - img->u; 
unsigned char "v < img->v; 
ptrdiff t reference offset; 
union mv chroma mv[4]; 
int full pixel = ctx->frame hdr.version == 3; 
int b; 
if (mbi->base.y mode != SPLITMV) 
{ 
union mv uvmv; 
uvmv < mbi->base.mv; 
uvmv.d.x = (uvmv.d.x + 1 + (uvmv.d.x >> 31) * 2) / 2; 
uvmv.d.y = (uvmv.d.y + 1 + (uvmv.d.y >> 31) * 2) / 2; 
if (full pixel) 
{ 
uvmv.d.x &= ^7; 
uvmv.d.y &= ^7; 
} 
chroma_mv[0] = 
chroma mv[1] = 
chroma mv[2] < 
chroma mv[3] < uvmv; 
} 
else 
{ 
chroma mv[0] = calculate chroma splitmv (mbi, 0, full pixel); 
chroma mv[1] = calculate chroma splitmv(mbi, 2, full pixel); 
chroma mv[2] < calculate chroma splitmv (mbi, 8, full pixel); 
chroma mv[3] < calculate chroma splitmv(mbi, 10, full pixel); 
} 
reference offset = ctx->ref frame offsets[mbi->base.ref frame]; 
for (b = 0; b < 16; bt) 
{ 
union mv *ymv; 
if (mbi->base.y_mode != SPLITMV) 
ymv = &mbi->base.mv; 


else 
ymv = mbi->split.mvs + b; 
recon 1 block(y, y + reference offset, img->stride, 
ymv, ctx->subpixel filters, coeffs, mbi, b); 
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for (b = 0; b < 4; b++) 


recon 1 block(u, u t reference offset, 

img->uv stride, &chroma mv[b], 

ctx->subpixel filters, coeffs, mbi, b t 16); 
recon 1 block(v, v t reference offset, 

img->uv stride, &chroma mv[b], 

ctx->subpixel filters, coeffs, mbi, b t 20); 


u += 4; 

v += 4; 

if (b & 1) 

{ 
u += 4 * img-»uv stride - 8; 
v += 4 * img-»uv stride - 8; 


void 
vp8 dixie release ref frame(struct ref cnt img *rcimg) 
{ 
if (rcimg) 
{ 
assert (rcimg->ref cnt); 
rcimg->ref cnt--; 


struct ref cnt img * 
vp8 dixie ref frame(struct ref cnt img *rcimg) 
{ 

rcimg->ref cnttt; 

return rcimg; 
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truct ref_cnt_img *frames) 


Tnt i; 
for (i = 0; i < NUM REF FRAMES; i++) 
if (frames[i].ref cnt == 0) 
{ 
frames[i].ref_cnt = 1; 
return sframesli]; 
} 
assert (0); 
return NULL; 
} 
static void 
fixup_left (unsigned char *predict, 
int width, 
int stride, 
unsigned int row, 
enum prediction_mode mode) 


/* 


The left column of out-o 
above row, unless this i 
* 129. 

XJ 
unsigned char *left 
int 1; 


predi 


if 
{ 


(mode 


DC_PRED && row) 


unsigned char *above 


for i < width; 
= above[il; 
stride; 


} 


else 


{ 


/* Need to re-set the above row, 


* DC PRED. 
+ 
left -= stride; 
Bankoski, et al. 


unless we’re doing DC_PRED, 


Informational 


f-frame pixels is taken to be 129, 
in which case we duplicate the 
s also row 0, in which case we use 


GE cns 


predict - stride; 


i++) 


in case the above MB was 


[Page 225] 


RFC 6386 VP8 Data Format and Decoding Guide November 2011 


for (i = -1; i < width; i++) 


*left = 129; 
left += stride; 


static void 


fixup_above (unsigned char *predict, 
int width, 
int stride, 
unsigned int col, 


enum prediction_mode mode) 


/* The above row of out-of-frame pixels is taken to be 127, 
* unless we’re doing DC_PRED, in which case we duplicate the 
* left col, unless this is also col 0, in which case we use 


* 127. 
E 
unsigned char *above = predict - stride; 
int i; 
if (mode -- DC PRED && col) 
{ 
unsigned char *left = predict - 1; 


for (i = 0; i < width; i++) 
{ 

above[i] = *left; 

left += stride; 


} 


else 
/* Need to re-set the left col, in case the last MB was 
* DC_PRED. 
%/ 
memset (above - 1, 127, width + 1); 


memset (above + width, 127, 4); // for above-right subblock modes 
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void 
vp8 dixie predict init(struct vp8 decoder ctx *ctx) 


{ 


int 1i; 
unsigned char *this_frame_base; 


if (ctx->frame hdr.frame size updated) 
for (i = 0; i < NUM REF FRAMES; i++) 
{ 
unsigned int w = ctx-»mb cols * 16 + BORDER PIXELS * 2; 
unsigned int h ctx-»mb rows * 16 + BORDER PIXELS * 2; 


vpx img free(&ctx-»frame strg[il.img); 
ctx->frame strgl[li].ref cnt = 0; 
ctx->ref frames[i] < NULL; 


if (!vpx img alloc(&ctx-»frame strgli].img, 
IMG FMT I420, w, h, 16)) 
vpx internal error(&ctx-»error, VPX CODEC MEM ERROR, 
"Failed to allocate %dx%d" 
" framebuffer", 
w, h); 


vpx img set rect(&ctx-»frame strg[i].img, BORDER PIXELS, 
BORDER PIXELS, ctx->frame hdr.kf.w, 
ctx->frame hdr.kf.h); 


} 


if (ctx->frame hdr.version) 

ctx->subpixel filters = bilinear filters; 
else 

ctx->subpixel filters - sixtap filters; 


} 


/* Find a free framebuffer to predict into */ 
if (ctx->ref frames[CURRENT FRAME]) 
vp8 dixie release ref frame(ctx->ref frames[CURRENT FRAME]); 


ctx->ref frames[CURRENT FRAME] - 
vp8 dixie find free ref frame(ctx->frame strg); 
this frame base = ctx->ref frames[CURRENT FRAME]->img.img data; 
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/* Calculate offsets to the other reference frames */ 
for (i = 0; i < NUM REF FRAMES; i++) 
{ 


struct ref cnt img "ref = ctx->ref frames[il; 


ctx->ref frame offsets[i] < 
ref ? ref->img.img data - this frame base : 0; 


} 


/* TODO: No need to do this on every frame... */ 


void 
vp8 dixie predict destroy(struct vp8 decoder ctx *ctx) 


{ 


int i; 


for (i = 0; i < NUM REF FRAMES; i++) 
{ 


vpx img free(&ctx-»frame strg[il.img); 


ctx->frame strg[i].ref cnt < 0; 
ctx->ref frames[i] < NULL; 
} 
} 
void 
vp8 dixie predict process row(struct vp8 decoder ctx *ctx, 
unsigned int row, 
unsigned int start col, 
unsigned int num cols) 


struct img index img; 
struct mb info *mbi; 
unsigned int col; 
short *coeffs; 
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/* Adjust pointers based on row, start col */ 


img. 


img 


img. 
img. 
img. 
img. 
img. 
img. 


mbi 
coe 


stride - 

ctx->ref frames[CURRENT FRAME]->img.stride[PLANE Y]; 
.uv stride - 

ctx->ref frames[CURRENT FRAME]->img.stride[PLANE U]; 


y = ctx-»ref frames[CURRENT FRAME]-»img.planes[PLANE Y]; 
u = ctx-»ref frames[CURRENT FRAME]->img.planes[PLANE U]; 
v — ctx-»ref frames[CURRENT FRAME]->img.planes[PLANE V]; 
y += (img.stride * row + start col) * 16; 

u += (img.uv stride * row + start col) * 8; 

v += (img.uv stride * row + start col) * 8; 

= ctx-»mb info rows[row] + start col; 

ffs = ctx->tokens[row & 


(ctx->token hdr.partitions - 1)].coeffs t 
25 * 16 % start col; 


/* Fix up the out-of-frame pixels */ 


if 
{ 


for 


Bankoski, 


(start_col == 0) 


fixup left(img.y, 16, img.stride, row, mbi->base.y mode); 
fixup left(img.u, 8, img.uv stride, row, mbi->base.uv mode); 
fixup left(img.v, 8, img.uv stride, row, mbi->base.uv mode); 


if (row << 0) 

*(img.y - img.stride - 1) < 127; 
(col = start col; col < start col + num cols; col++) 
if (row << 0) 


{ 
fixup_above(img.y, 16, img.stride, col, 
mbi->base.y_mode) ; 
fixup_above(img.u, 8, img.uv_stride, col, 
mbi-»base.uv mode); 
fixup above(img.v, 8, img.uv stride, col, 
mbi-»base.uv mode); 


} 


if (mbi->base.y_mode <= B_PRED) 
{ 
predict_intra_luma(img.y, img.stride, mbi, coeffs); 
predict_intra_chroma(img.u, img.v, img.uv_stride, mbi, 
coeffs); 
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else 
{ 
if (mbi->base.y mode != SPLITMV) // && != BPRED 
fixup_dc_coeffs(mbi, coeffs); 
if (mbi->base.need_mc_border) 
predict_inter_emulated_edge(ctx, &img, coeffs, mbi, 
col, row); 
else 
predict_inter(ctx, &img, coeffs, mbi); 
} 
/* Advance to the next macroblock */ 
mbi++; 
img.y += 16; 
img.u += 8; 
img.v += 8; 
coeffs += 25 * 16; 
} 
if (col == ctx->mb_cols) 


/* Extend the last row by four pixels for intra-prediction. 
* This will be propagated later by copy_down. 

*4 
uint32 t *extend = (uint32 t *)(img.y + 15 * img.stride); 
uint32 t val = 0x01010101 * img.y[-1 + 15 * img.stride]; 


*extend = val; 


} 


---- End code block 
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20.15. 


predict.h 


+==>="Be91n code block === 220932 2 SH HS 32 


/ 


+ + X F + F F Xo F 


Copyright (c) 2010, 2011, Google Inc. All rights reserved. 


Use of this source code is governed by a BSD-style license 

that can be found in the LICENSE file in the root of the source 
tree. An additional intellectual property rights grant can be 
found in the file PATENTS. All contributing project authors may 
be found in the AUTHORS file in the root of the source tree. 


#ifndef PREDICT H 
#define PREDICT H 


void 
vp8 dixie predict init(struct vp8 decoder ctx "ctx); 


void 
vp8 dixie predict destroy(struct vp8 decoder ctx "ctx); 


void 


vp8 dixie predict process row(struct vp8 decoder ctx *ctx, 
unsigned int row, 
unsigned int start col, 
unsigned int num cols); 


void 
vp8 dixie release ref frame(struct ref cnt img *rcimg); 


struct ref cnt img * 
vp8 dixie ref frame(struct ref cnt img *rcimg); 


struct ref cnt img * 
vp8 dixie find free ref frame(struct ref cnt img *frames); 


Hendif 


==== End: code block SSS SSS 5 SSS SSeS SSS SS SSS === cec 
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20.16. 


tokens.c 


.-c-ocuBegin code block —-s-----2-e--59-9—B-es-enencegstepB-eoeu 


/ 


+ + X £ + Ro F Xo F 


/ 


Copyright (c) 2010, 2011, Google Inc. All rights reserved. 


Use of this source code is governed by a BSD-style license 

that can be found in the LICENSE file in the root of the source 
tree. An additional intellectual property rights grant can be 
found in the file PATENTS. All contributing project authors may 
be found in the AUTHORS file in the root of the source tree. 


finclude "vpx codec internal.h" 
#include "dixie.h" 

#include "tokens.h" 

#include <stdlib.h> 

#include <string.h> 

#include <malloc.h> 


enum 


EOB_CONTEXT_NODE, 
ZERO_CONTEXT_NODE, 
ONE_CONTEXT_NODE, 
LOW_VAL_CONTEXT_NODE, 
TWO_CONTEXT_NODE, 
THREE_CONTEXT_NODE, 
HIGH LOW CONTEXT NODE, 
CAT ONE CONTEXT NODE, 
CAT THREEFOUR CONTEXT NODE, 
CAT THREE CONTEXT NODE, 
CAT FIVE CONTEXT NODE 


enum 


ZERO, TOKEN, 

ONE TOKEN, 

TWO TOKEN, 

THREE TOKEN, 

FOUR TOKEN, 

DCT VAL CATEGORY], 
DCT VAL CATEGORY2, 
DCT VAL CATEGORY3, 
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DCT VAL CATEGORY4, 
DCT VAL CATEGORYS5, 
DCT VAL CATEGORY6, 
DCT EOB TOKEN, 

MAX ENTROPY TOKENS 


y; 


struct extrabits 


{ 
short 
short 
unsigned 
y; 
static const 
{ 
0, 
4, 
y; 
static 
{ 
0, 
4, 
y; 
#define X(n) 
static const 


min_val; 
length; 
char probs[12]; 


unsigned int left_context_index[25] 


((n) * PREV_COEFF_CONTEXTS * ENTROPY_NODES) 
unsigned int bands x[16] 


X(0), X(1), X(2), X(3), X(6), X(4), X(5), X(6), 
X(6), X(6), X(6), X(6), X(6), X(6), X(6), X(7) 
y; 
#undef X 


static const struct extrabits extrabits[MAX ENTROPY TOKENS] 


{ 


(11, 
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( 0, 0, 0, 0, 0, 0, 
0, 0, 0, O; 0, 0 ) |, //ZERO TOKEN 
( 0, 0, 0, 0, 0, 0, 
0, 0, OF 0, 0, O ) ), //ONE TOKEN 
( 0, 0, 0, 0, 0, 0, 
0, o 0, 0, 0, O } ), //TWO TOKEN 
( 0, 0, 0, 0, 0, 0, 
0, 0, 0, 0, 0, O } ), //THREE TOKEN 
( 0, 0, 0, 0, 0, 0, 
0, 0, 0, 0, 0, 0 y ), //FOUR TOKEN 
(159, 0, 0, 0, 0, 0, 
0, 0, 0, 0, 0, 0) ), //DCT VAL CATEGORY1 
(145, 165, 0, 0 0, 0, 
0, 0, 0, 0, 0, 0) ), //DCT VAL CATEGORY2 
(140, 148, 173, 0, 0, 0, 
0, Or 0, O, 0, O } }, //DCT VAL CATEGORY3 
Informational 
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119, 3, (135, 140, 155, 176, 0, 0, 
0, 0, 0, 0, 0, 0) ), //DCT VAL CATEGORY4 
(35, 4, (i130; 134, 1241; d57,- 180; 0, 
0, 0, 0, 0, 0, O } }, //DCT VAL CATEGORY5 
(67, 10, 1129, 130, 133, 140, 153, 177, 
196, 230, 243, 254, 254, 0) ), //DCT VAL CATEGORY6 
0, -1, { 0, 0, 0, 0, 0, 0, 
0, 0, 0, 0, 0, 0) ), // EOB TOKEN 
); 


static const unsigned int zigzag[16] 


y; 


#define DECODE AND APPLYSIGN(value to sign) N 
v = (bool get bit(bool) ? -value to sign \ 
value to sign) * daqf[!!cl; 


#define DECODE AND BRANCH IF ZERO(probability,branch) N 
if (!bool get(bool, probability)) goto branch; 


#define DECODE AND LOOP IF ZERO(probability,branch) \ 
if (!bool get(bool, probability)) N 
LA 


prob = type_probs; \ 
if (c«15) (^ 
TG; N 
prob += bands x[c]; \ 
goto branch; N 
}\ 
else \ 
goto BLOCK_FINISHED; /* for malformed input */\ 
} 


#define DECODE SIGN WRITE COEFF AND CHECK EXIT(val) \ 
DECODE AND APPLYSIGN(val) N 
prob = type probs + (ENTROPY NODES*2); \ 
if (c < 15) (^ 
b tokens[zigzag[cl] < v; N 
FEGA N 
goto DO WHILE; }\ 
b tokens[zigzag[15]] = v; \ 
goto BLOCK FINISHED; 
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#define DECODE EXTRABIT AND ADJUST VAL(t,bits count)N 
val += bool get(bool, extrabits[t].probs[bits count]) << \ 


bits count; 


static int 

decode mb tokens(struct bool decoder 
token entropy ctx t 
token entropy ctx t 
short 


enum prediction mode 


coeff probs table t 


short 
{ 
int i, stop, type; 
int cp ty Vi 
int val, bits_count; 
int eob_mask; 
short *b tokens; 
unsigned char *type probs; 
unsigned char *prob; 
short *dgf; 
eob mask < 0; 
if (mode != B PRED && mode 
{ 
i = 24; 
stop = 24; 
type = 1; 
b_tokens = tokens + 24 * 16; 
dqf = factor[TOKEN_BLOCK_Y2]; 
} 
else 
{ 
i= 0; 
stop = 16; 
type = 3; 
b_tokens = tokens; 
dqf = factor[TOKEN BLOCK Yl]; 


} 


*bool, 

Teft., 

above, 

*tokens, 

mode, 

probs, 
factor[TOKEN BLOCK TYPES]I[2]1) 


// tokens for this block 
// probabilities for this block type 


!< SPLITMV) 


/* Save a pointer to the coefficient probs for the current type. 
* Need to repeat this whenever type changes. 


*/ 
type probs = probs[type] [0] [0]; 
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BLOCK LOOP: 
t = left[left context index[i]] + above[above context indexli]]; 
c = !type; /* all blocks start at 0 except type 0, which starts 


* at 1. */ 


prob < type probs; 
prob *- t * ENTROPY NODES; 


DO WHILE: 
prob += bands x[c]; 
DECODE AND BRANCH IF ZERO(prob[EOB CONTEXT NODE], 
BLOCK FINISHED); 


CHECK 0 : 
DECODE AND LOOP IF ZERO (prob[ZERO CONTEXT NODE], CHECK 0 ); 
DECODE AND BRANCH IF ZERO(prob[ONE CONTEXT NODE], 

ONE CONTEXT NODE 0 ); 

DECODE AND BRANCH IF ZERO (prob[LOW VAL CONTEXT NODE], 

LOW VAL CONTEXT NODE 0 ); 

DECODE AND BRANCH IF ZERO(prob[HIGH LOW CONTEXT NODE], 

HIGH LOW CONTEXT NODE 0 ); 

DECODE AND BRANCH IF ZERO(prob[CAT THREEFOUR CONTEXT NODE], 
CAT THREEFOUR CONTEXT NODE 0 ); 
DECODE AND BRANCH IF ZERO(prob[CAT FIVE CONTEXT NODE], 

CAT FIVE CONTEXT NODE 0 ); 

val < extrabits[DCT VAL CATEGORY6] .min val; 

bits count = extrabits[DCT VAL CATEGORY6].length; 


do 
{ 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORYO, 
bits. count); 
bits count --; 
} 


while (bits_count >= 0); 


DECODE_SIGN_WRITE_COEFF_AND_CHECK_EXIT (val) ; 


I 


CAT FIVE CONTEXT NODE 0 : 
val < extrabits[DCT VAL CATEGORY5] .min val; 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY5, 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY5, 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY), 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY), 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY5, 
DECODE SIGN WRITE COEFF AND CHECK EXIT (val); 


Se 


Tm 


e 


OPNO Ad 
`~ 


`e 
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CAT THREEFOUR CONTEXT NODE 0 : 

DECODE AND BRANCH IF ZERO(prob[CAT THREE CONTEXT NODE], 
CAT THREE CONTEXT NODE 0 ); 
val < extrabits[DCT VAL CATEGORY4] .min val; 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY4, 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY4, 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY4, 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY4, 
DECODE SIGN WRITE COEFF AND CHECK EXIT (val); 


. e. 


5. 


OE NUW 
`~ 


Se 


CAT_THREE_CONTEXT_NODE_0_: 
val = extrabits[DCT VAL CATEGORY3].min val; 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY3, 2); 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY3, 1); 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY3, 0); 
DECODE SIGN WRITE COEFF AND CHECK EXIT (val); 


HIGH LOW CONTEXT NODE 0 : 
DECODE AND BRANCH IF ZERO(prob[CAT ONE CONTEXT NODE], 
CAT ONE CONTEXT NODE 0 ); 


val < extrabits[DCT VAL CATEGORY2].min val; 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY2, 1); 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY2, 0); 
DECODE SIGN WRITE COEFF AND CHECK EXIT (val); 


CAT ONE CONTEXT NODE 0 : 

val < extrabits[DCT VAL CATEGORY1] .min val; 
DECODE EXTRABIT AND ADJUST VAL(DCT VAL CATEGORY], 0); 
DECODE SIGN WRITE COEFF AND CHECK EXIT (val); 


LOW VAL CONTEXT NODE 0 : 

DECODE AND BRANCH IF ZERO(prob[TWO CONTEXT NODE], 
TWO CONTEXT NODE 0 ); 
DECODE AND BRANCH IF ZERO(prob[THREE CONTEXT NODE], 
THREE CONTEXT NODE 0 ); 
DECODE SIGN WRITE COEFF AND CHECK EXIT (4); 


THREE CONTEXT NODE 0. 
DECODE SIGN WRITE COEFF AND CHECK EXIT(3); 


TWO CONTEXT NODE O : 
DECODE SIGN WRITE COEFF AND CHECK EXIT (2); 


ONE CONTEXT NODE 0 : 
DECODE AND APPLYSIGN(1); 
prob = type probs + ENTROPY NODES; 
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if (c « 15) 

{ 
b_tokens[zigzag[c]] = v; 
ttc; 
goto DO WHILE; 

} 


b_tokens[zigzag[15]] = v; 
BLOCK_FINISHED: 


eob mask |= (c > 1) << i; 

t = (c != !type); // any non-zero data? 

eob_mask |= t << 31; 

left [left_context_index[i]] = above[above_context_index[i]] = t; 


b_tokens += 16; 
itt; 


if (i « stop) 
goto BLOCK LOOP; 


if (i << 25) 
( 
type < 0; 
i= 0; 
stop = 16; 
type probs = probs[type] [0] [0]; 
b tokens - tokens; 


dqf = factor[TOKEN BLOCK Yl]; 
goto BLOCK LOOP; 
} 


if (i == 16) 

{ 
type = 2; 
type_probs = probs[type] [0] [0]; 
stop = 24; 
dqf = factor[TOKEN BLOCK UV]; 
goto BLOCK LOOP; 

} 


return eob mask; 
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static void 
reset row context(token entropy ctx t *left) 
{ 

memset (left, 0, sizeof (*left)); 


} 


static void 
reset_above_context (token_entropy_ctx_t *above, unsigned int cols) 
{ 

memset (above, 0, cols * sizeof (*above)); 


} 


static void 

reset mb context(token entropy ctx t  *left, 
token entropy ctx t  *above, 
enum prediction mode mode) 


/* Reset the macroblock context on the left and right. We have 
* to preserve the context of the second order block if this mode 
* would not have updated it. 


8/ 
memset (left, 0, sizeof((*left)[0]) * 8); 
memset (above, 0, sizeof((*above)[0]) * 8); 
if (mode !- B PRED && mode != SPLITMV) 
{ 
(*left) [8] = 0; 
(*above) [8] = 0; 
} 
} 
void 
vp8 dixie tokens process row(struct vp8_decoder_ctx *ctx, 
unsigned int partition, 
unsigned int row, 
unsigned int start col, 
unsigned int num cols) 
( 
struct token decoder *tokens = &ctx->tokens [partition]; 
short coeffs = tokens->coeffs + 25 * 16 * start col; 
unsigned int col; 
token entropy ctx t "above = ctx-»above token entropy ctx 


+ start col; 
token entropy ctx t "left = &tokens-»left token entropy ctx; 
struct mb info *mbi = ctx-»mb info rows[row] + start col; 
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if (row << 0) 
reset above context (above, num cols); 


if (start col << 0) 
reset row context (left); 


for (col = start col; col < start col + num cols; col++) 


{ 


memset (coeffs, 0, 25 * 16 * sizeof(short)); 


if (mbi->base.skip coeff) 

{ 
reset mb context(left, above, mbi->base.y mode); 
mbi->base.eob_mask = 0; 

} 

else 


{ 


struct dequant factors *dqf; 


dqf = ctx-»dequant factors + mbi->base.segment id; 
mbi->base.eob mask - 
decode mb tokens(&tokens-»bool, 
*left, *above, 
coeffs, 
mbi->base.y mode, 
ctx->entropy hdr.coeff probs, 
dgf->factor); 
} 


abovett; 
mbitt; 
coeffs += 25 * 16; 


void 
vp8 dixie tokens init(struct vp8 decoder ctx *ctx) 


( 


unsigned int partitions - ctx->token hdr.partitions; 


if (ctx->frame hdr.frame size updated) 
{ 
unsigned int i; 
unsigned int coeff_row_sz = 
ctx->mb_cols * 25 * 16 * sizeof(short); 
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for (i = 0; i < partitions; i++) 


free (ctx->tokens[i].coeffs); 


November 2011 


ctx-»tokens[i].coeffs = memalign(16, coeff row sz); 


if (!lctx->tokens[i].coeffs) 


vpx internal error(&ctx-»error, VPX CODEC MEM ERROR, 


NULL); 
} 


free(ctx-»above token entropy ctx); 

ctx->above token entropy ctx - 
calloc(ctx-»mb cols, 
Sizeof(*ctx-»above token entropy ctx)); 


if (!ctx-»above token entropy. ctx) 
vpx internal error (sctx->error, 
VPX CODEC MEM ERROR, NULL); 


void 
vp8  dixie tokens destroy(struct vp8 decoder ctx *ctx) 
{ 
int i; 
for (i = 0; i < MAX PARTITIONS; i++) 
free (ctx->tokens[i].coeffs); 


free(ctx-»above token entropy ctx); 


} 


=o os ENG code block. ==> aS SRS 
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20.17. 


tokens.h 


.-cs-UBegin:code block === 220932 2 SH HS 52 


/ 


+ + X F + F F Xo F 


/ 


Copyright (c) 2010, 2011, Google Inc. All rights reserved. 


Use of this source code is governed by a BSD-style license 

that can be found in the LICENSE file in the root of the source 
tree. An additional intellectual property rights grant can be 
found in the file PATENTS. All contributing project authors may 
be found in the AUTHORS file in the root of the source tree. 


#ifndef TOKENS H 
#define TOKENS H 


void 
vp8 dixie tokens init(struct vp8 decoder ctx "ctx); 


void 
vp8 dixie tokens destroy(struct vp8 decoder ctx "ctx); 


void 


vp8 dixie tokens process row(struct vp8 decoder ctx *ctx, 
unsigned int partition, 
unsigned int row, 
unsigned int start col, 
unsigned int num cols); 


Hendif 


Soh Ena code. block. === mom S IA IAE 
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20.18.  vp8 prob data.h 


.cs—UBegin o code block === 2209321 ooo Se HS 232 


/* 
* Copyright (c) 2010, 2011, Google Inc. All rights reserved. 

* 

* Use of this source code is governed by a BSD-style license 

* that can be found in the LICENSE file in the root of the source 
* tree. An additional intellectual property rights grant can be 

x found in the file PATENTS. All contributing project authors may 
* be found in the AUTHORS file in the root of the source tree. 

* 


/ 


static const 
unsigned char k coeff entropy update probs[BLOCK TYPES] [COEFF BANDS] 
[PREV COEFF CONTEXTS] 
[ENTROPY NODES] - 
{ 
{ 


1255,.:255, 255, 255, 255, 255, 255, 255,255, 255, 255], 
[255, 255, 255, 255, 255, 255, 255, 255, 255, 295, 255], 
[25554 255, 255; 255, 255,.255, 255, 255, 255, 255; 2551 


11/767 246, 255, 255, 255, 255, 255, 255, 255, 299, 255]; 
1223, 241, 252, 255, 255, 255, 255,255, 255, 255, 299), 
1249; 253, 253, 255, 255, 255, 255, 255,.255, 255, 255], 


1255, 244, 252, 255, 255, 255, 255, 255, 255, 255, 255H 
1234, 254, 254, 255, 255, 255, 255, 255, 290; 255, 255], 
1253, 255, 255, 255, 255, 255, 255, 255, 255, 255, 299], 


(255, 246, 254, 255, 255, 255, 255, 255, 255, 255, 255], 
(239, 253, 254, 255, 255, 255, 255, 255, 255, 255, 255], 
(254, 255, 254, 255, 255, 255, 255, 255, 255, 255, 255], 


(255; 248, 254, 255, 255, 255, 255, 255, 255, 255, 255], 

1251, 255, 254, 255, 255, 255, 255, 255, 255, 299; 299], 

(259; 2255, 255, 255, 255, 255, 255, 2997 2995y 255, 255], 
), 
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12555 259, 254; 255, 2554, 255, 255, 2997 255, 255, 255F, 
1251, 254,.254, 255, 255, 255, 255, 255, 299, 255, 295], 
(254, 255, 254, 255, 255, 255, 255, 255, 255, 255, 255], 


(299; 254, 293, 255, 254, 255, 255, 255,:2b55;, 2557 255], 
(250, 255, 254, 255, 254, 255, 255, 255, 255, 255, 255], 
1254; 255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 


1255,.255, 255, 255, 255, 255, 255, 255, 255, 299, 2955]; 

12555. 255, 255, 255,:255,.255, 255, .255, 255,4, 255, 299], 

1255, 255, 255, 255, 255, 255, 255, 255,255, 255, 255FH 
), 


1217, 255, 255, 255, 255, 299, 255, 255, 255, 255, 299]; 
(225, 252, 241, 253, 255, 255, 254, 255, 255, 255, 255], 
(234, 250, 241, 250, 253, 255, 253, 254, 255, 255, 255], 


1255, 254, 259; 255, 255, 255, 255, 255, 255, 255, 299], 
1223, 254,.254, 255, 255, 255, 255, 255, 255, 255, 299}, 
(238, 253, 254, 254, 255, 255, 255, 255, 255, 255, 255], 


1255, 248, 254, 255, 255, 255, 255, 255, 255, 255, 255], 
(249, 254, 255, 255, 255, 255, 255, 255, 255, 255, 255], 
1255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 


1255, 253, 255, 255, 255, 255, 255, 255, 255, 255, 299]; 
{2477 254r 2597: 295, 295; 255, 255, ;:255,. 255, 255, 299], 
1255, 255, 2555, 255, 255, 255, 255, 255, 255, 259, 255], 


[255,4 253, 294; 255, 2554, 255, 255, 299; 255, 299,299], 
1252, 255,255, 255, 255, 255, 255, 255, 299, 255, ZI}, 
(255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 


1255, 254, 254, 255, 255, 255, 255, 255, 255, 255, 255], 

1253, 255, 255, 255, 255, 255, 255, 255,.255, 255, 299]; 

1255; 255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 
), 
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1299; 254, 253, 255, 2554. 255, 255, 2997 255, 299,299), 
1250; “295, 299; 255, 255, 255, 255, 255, 299, 255, 299}, 
(254, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 


1255, 290, 255, 255, 255, 255, 255, 29097; 290; 255, 255], 

1255, 255, 255; 255, 255,.255, 255, 255,.255, 255, 299]; 

1255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 
), 


1186, 251, 250, 255, 255, 255, 255, 255,.255, 299, 255) 
(234, 251, 244, 254, 255, 255, 255, 255, 255, 255, 255], 
(1251; 251, 243; 253, 254, 255, 254, 255, 255, 295, 295]; 


1255, 253, 254, 1255; 255, 255, 255, 255, 255, 255, 299]; 
1236;. 253, . 254, 259, 255, 255, 255, 255, 255, 255, 255], 
(251, 253, 253, 254, 254, 255, 255, 255, 255, 255, 255], 


1255, 254, 254; 255, 255, 255, 255, 255, 255, 255, 299], 
(254, 254, 254, 255, 255, 255, 255, 255, 255, 255, 255], 
1255; 255, 255, 255, 255, 255, 255, 255, 255, 255, 255k 


1255, 2254, 255, 255, 255, 255, 255, 255, 255, 255, 255], 
(254, 254, 255, 255, 255, 255, 255, 255, 255, 255, 255], 
1254, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 


1255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 299]; 
1254, 255, 25574. 255, 255, 255, 255,255, 255, 255, 299], 
1255,. 255, 299% 255, 255, 255, 255, 255, 255, 299, 255], 


[2554 255, 255,. 255, 259; 295; 255, 2997 255, 299,299), 
1255, 255, 255, 255, 255, 255, 255, 255, 299, 255, 255b, 
(255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 


1255, 290, 255, 255, 255, 255, 255, 255, 255, 255, 255], 

1255, 255, 255; 255, 255, 255, 255, 255,.255, 255, 299]; 

1255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 
), 
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12554 255, 255,. 255, 259; 295; 299; 2997 255, 299,299), 

1255, 255, 255, 255, 255, 255, 255, 255, 299, 255, ZI}, 

(255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 
), 


(248, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 
(250, 254, 252, 254, 255, 255, 255, 255, 255, 255, 255], 
(248, 254, 249, 253, 255, 255, 255, 255, 255, 255, 255], 


1255, 253,-253, 255p, 255, 255, 255, 255,.255, 299, 295], 
1246, 253, 253, 255, 255, 255, 255, 255, 255, 255, 299, 
(252, 254, 251, 254, 254, 255, 255, 255, 255, 255, 255], 


(255, 254, 252, 255, 255, 255, 255, 255, 255, 255, 255], 
1248, 254, 253, 255, 255, 255, 255, 255, 255, 255, 255], 
1253, 255, 254, 254, 255, 255, 255, 255, 255, 255, 299); 


125554 251, 254; 255, 255, 255, 255, 255, 255, 255, 299], 
(245, 251, 254, 255, 255, 255, 255, 255, 255, 255, 255], 
1253, 253, 254, 255, 255, 255, 255, 255, 255, 255, 255, 


(299; 251, 293, 259; 255, 255, 255, 2007; 2990; 255, 255], 
1252, 253, 254, 255, 255, 255, 255, 255, 255, 255, 255], 
1255, 254, 255, 255, 255, 255, 255, 255, 255, 255, 255], 


1255, 252, 255, 255, 255, 255, 255, 255, 255, 255, 299]; 
1249, 255, 254; 255, 255, 255, 255,.255, 255, 255, 299], 
(255, 255, 254, 255, 255, 255, 255, 255, 255, 255, 255], 


[25554 255, 253, 255, 2554 255, 255, 299; 255, 255, 255], 

1250, 255, 255, 255, 255, 255, 255, 255, 299, 255, 255], 

(255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255], 
}, 
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(255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255), 
(254, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255), 
(255, 255, 255, 255, 255, 255, 255, 255, 255, 255, 255), 
), 
), 
y; 


static const 
unsigned char k default y mode probs [] 
{ 112, 86, 140, 37}; 


static const 
unsigned char k_default_uv_mode_probs [] 
{ 162, 101, 204}; 


static const 

unsigned char k_default_coeff_probs [BLOCK_TYPES] [COEFF_BANDS] 

[PREV COEFF CONTEXTS][ENTROPY NODES] = 

{ 

{ /* block type 0 */ 
{ /* coeff band 0 */ 

{ 128, 128, 128, 128, 128, 128, 128, 128, 128, 128, 128}, 
{ 128, 128, 128, 128, 128, 128, 128, 128, 128, 128, 128}, 
{ 128, 128, 128, 128, 128, 128, 128, 128, 128, 128, 128} 


{ /* coeff band 1 */ 
{ 253, 136, 254, 255, 228, 219, 128, 128, 128, 128, 128}, 
{ 189, 129, 242, 255, 227, 213, 255, 219, 128, 128, 128}, 
{ 106, 126, 227, 252, 214, 209, 255, 255, 128, 128, 128} 


{ /* coeff band 2 */ 
{ qs 98, 248, 255, 236, 226, 255, 255, 128, 128, 128], 
{ 181, 133, 238, 254, 221, 234, 255, 154, 128, 128, 128}, 
1 78, 134, 202, 247, 198, 180, 255, 219, 128, 128, 128) 


( /* coeff band 3 */ 
1 1, 185, 249, 255, 243, 255, 128, 128, 128, 128, 128), 
{ 184, 150, 247, 255, 236, 224, 128, 128, 128, 128, 128}, 
1 77, 110, 216, 255, 236, 230, 128, 128, 128, 128, 128) 
, 


Bankoski, et al. Informational [Page 247] 


RFC 6386 VP8 Data Format and Decoding Guide November 2011 


( /* coeff band 4 */ 
{ 1l, 101, 251, 255, 241, 255, 128. 128, 128, 128. 128}, 
{ 170, 139, 241, 252, 236, 209, 255, 255, 128, 128, 128}, 
{ 37, 116, 196, 243, 228, 255, 255, 255, 128, 128, 128} 


{ /* coeff band 5 */ 
{ 1, 204, 254, 255, 245, 255, 128, 128, 128, 128, 128}, 
{ 207, 160, 250, 255, 238, 128, 128, 128, 128, 128, 128}, 
{ 102, 103, 231, 255, 211, 171, 128, 128, 128, 128, 128} 


( /* coeff band 6 */ 
1 1, 152, 252, 255, 240, 255, 128, 128, 128, 128, 128), 
{ 177, 135, 243, 255, 234, 225, 128, 128, 128, 128, 128}, 
1 80, 129, 211, 255, 194, 224, 128, 128, 128, 128, 128) 


( /* coeff band 7 */ 
1 1.5 1, 255, 128, 128, 128, 128, 128, 128, 128, 128), 
( 246, 1, 255, 128, 128, 128, 128, 128, 128, 128, 128), 
( 255, 128, 128, 128, 128, 128, 128, 128, 128, 128, 128] 
) 
), 
( /* block type 1 */ 
( /* coeff band 0 */ 
{ 198, 35, 237, 223, 193, 187, 162, 160, 145, 155, 62), 
piši; "45, 198; 221, 192.. 176, 220, 1547, 252,221, 135 
1 68, 47, 146, 208, 149, 167, 221, 162, 255, 223, 128) 


( /* coeff band 1 */ 
{ 1, 149, 241, 255, 221, 224, 255, 255, 128, 128, 1281, 
( 184, 141, 234, 253, 222, 220, 255, 199, 128, 128, 128), 
1 81, 99, 181, 242, 176, 190, 249, 202, 255, 255, 128) 


( /* coeff band 2 */ 
1 1, 129, 232, 253, 214, 197, 242, 196, 255, 255, 128), 
1 99, 121, 210, 250, 201, 198, 255, 202, 128, 128, 128), 
1 23, 91, 163, 242, 170, 187, 247, 210, 255, 255, 128) 

, 

( /* coeff band 3 */ 
1 1, 200, 246, 255, 234, 255, 128, 128, 128, 128, 128), 
( 109, 178, 241, 255, 231, 245, 255, 255, 128, 128, 128), 
1 44, 130, 201, 253, 205, 192, 255, 255, 128, 128, 128) 


( /* coeff band 4 */ 
1 1, 132, 239, 251, 219, 209, 255, 165, 128, 128, 128), 
1 94, 136, 225, 251, 218, 190, 255, 255, 128, 128, 128), 
1 22, 100, 174, 245, 186, 161, 255, 199, 128, 128, 128) 
ly 
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( /* coeff 


( /* coeff 
{ 1, 
{ 203, 
{ 137, 
} 
, 


( /* block type 2 */ 


band 
182, 


5.%/ 
249, 
241, 
181, 


255, 
255, 
251, 


232, 
227, 
193, 


235, 
234, 
211, 


6 */ 
247, 
235, 
188, 


255, 
2997 
251, 


236, 
225, 
195, 


231, 
227, 
217, 


7 */ 
251, 
248, 
7]; 


255, 
255, 
255, 


213, 
255, 
224, 


255, 
128, 
255, 


( /* coeff band O */ 


( 253, 


( /* coeff 
{ 1, 
{ 247, 
{ 240, 
ly 
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9, 
137 
13; 


band 
95, 
90, 
YU 


band 


248, 
224, 
ITI, 


251, 
243, 
221, 


207, 
193, 
161, 


208, 
185, 
179, 


JA 
247, 
244, 
195; 


253, 
250, 
248, 


212, 
217, 
188, 


183, 
209, 
195, 


25 E 
239, 
219, 
190, 


251, 
255, 
239, 


218, 
196, 
201, 


219, 
186, 
218, 


3*7 
251, 
249, 
248, 


255, 
255, 
255, 


255, 
213, 
255, 


128, 
255, 
128, 


4 */ 
248, 
230, 
255, 


255, 
255, 
128, 


255, 
236, 
128, 


128, 
255, 
128, 


5 %/ 
255, 
255, 
255, 


128, 
128, 
128, 


128, 
128, 
128, 


128, 
128, 
128, 


Informational 


128, 
128, 
255, 


255, 
255, 
299, 


128, 
128, 
128, 


255, 
249, 
236, 


255, 
255, 
255, 


255, 
128, 
255, 


128, 
128, 
128, 


128, 
128, 
128, 


128, 
128, 
128, 
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128, 
128, 
205, 


255, 
255, 
224, 


128, 
128, 
128, 


192; 
198, 
167, 


2555 
255, 
255, 


205, 
128, 
228, 


128, 
128, 
128, 


128, 
128, 
128, 


128, 
128, 
128, 
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128, 
128, 
128, 


128, 
128, 
128, 


128, 
128, 
128, 


128, 
255, 
255, 


128, 
128, 
128, 


128, 
128, 
128, 


128, 
128, 
128, 


128, 
128, 
128, 


128, 
128, 
128, 


128, 
128, 
128, 


128], 
128], 
128] 


128, 
128, 
128, 


128], 
128], 
128] 


128, 
128, 
128, 


128], 
128], 
128] 


128, 
255, 
234, 


128], 
128], 
128] 


128, 
128, 
128, 


128], 
128], 
128] 


128, 
128, 
128, 


128], 
128], 
128] 


128, 
128, 
128, 


128], 
128], 
128] 


128, 
128, 
128, 


128], 
128], 
128] 


128, 
128, 
128, 


128], 
128], 
128] 
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( /* coeff band 6 */ 
1 1, 134, 252, 255, 255, 128, 128, 128, 128, 128, 128), 
{ 213, 62, 250, 255, 255, 128, 128, 128, 128, 128, 128), 
1 DD 93, 255, 128, 128, 128, 128, 128, 128, 128, 128) 


( /* coeff band 7 */ 
[| 128, 128, 128, 128, 128, 128, 128, 128, 128, 128, 128}, 
( 128, 128, 128, 128, 128, 128, 128, 128, 128, 128, 128], 
[| 128, 128, 128, 128, 128, 128, 128, 128, 128, 128, 128} 
) 
- 
( /* block type 3 */ 
( /* coeff band 0 */ 
{ 202, 24, 213, 235, 186, 191, 220, 160, 240, 175, 255), 
{ 126, 38, 182, 232, 169, 184, 228, 174, 255, 187, 128), 
1 61, 46, 138, 219, 151, 178, 240, 170, 255, 216, 128) 


( /* coeff band 1 */ 
1 1, 112, 230, 250, 199, 191, 247, 159, 255, 255, 128), 
{ 166, 109, 228, 252, 211, 215, 255, 174, 128, 128, 128}, 
{ 39, 77, 162, 232, 172, 180; 245, 178, 255, 255, 128) 


( /* coeff band 2 */ 
1 1, 52, 220, 246, 198, 199, 249, 220, 255, 255, 128), 
[| 124, 74, 191, 243, 183, 193, 250, 221, 255, 255, 128), 
1 24, 71, 130, 219, 154, 170, 243, 182, 255, 255, 128) 


( /* coeff band 3 */ 
1 1, 182, 225, 249, 219, 240, 255, 224, 128, 128, 128), 
( 149, 150, 226, 252, 216, 205, 255, 171, 128, 128, 128), 
1 28, 108, 170, 242, 183, 194, 254, 223, 255, 255, 128) 


( /* coeff band 4 */ 
1 JA 81, 230, 252, 204, 203, 255, 192, 128, 128, 128), 
{ 123, 102, 209, 247, 188, 196, 255, 233, 128, 128, 128}, 
1 20, 95, 153, 243, 164, 173, 255, 203, 128, 128, 128) 


( /* coeff band 5 */ 
1 1, 222, 248, 255, 216, 213, 128, 128, 128, 128, 128), 
{ 168, 175, 246, 252, 235, 205, 255, 255, 128, 128, 128}, 
{ 47, 116, 215, 255, 211. 212. 255, 255, 129, 129, 128] 


( /* coeff band 6 */ 
1 1, 121, 236, 253, 212, 214, 255, 255, 128, 128, 128), 
{ 141, 84, 213, 252, 201, 202, 255, 219, 128, 128, 128}, 
{ 42, 80, 160, 240, 162, 185, 255, 205, 128, 128, 128} 
ly 
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( /* coeff band 7 */ 
1 1, 1, 255, 128, 128, 128, 128, 128, 128, 128, 128), 
{ 244, 1, 255, 128, 128, 128, 128, 128, 128, 128, 128), 
( 238, 1, 255, 128, 128, 128, 128, 128, 128, 128, 128) 


y; 


static const 
unsigned char k mv entropy update probs[2][MV PROB CNT] = 
{ 
{ 

237, 

246, 

253, 253, 254, 254, 254, 254, 254, 

254, 254, 254, 254, 254, 250, 250, 252, 254, 254 


231, 

243, 

245, 253, 254, 254, 254, 254, 254, 

254, 254, 254, 254, 254, 251, 251, 254, 254, 254 


y; 
static const 


unsigned char k default mv probs[2][MV PROB CNT] = 
{ 


{ // row 
162, // is short 
128, // sign 
225, 146, 172, 147, 214, 39, 156, // short tree 


128, 129, 132, 75, 145, 178, 206, 239, 254, 254 // long bits 


164, 

128, 

204, 170, 119, 235, 140, 230, 228, 

128, 130, 130, 74, 148, 180, 203, 236, 254, 254 


y; 


===> ¿End Code: block, ===5======35==3 === ss eS SS SoS ss 
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20.19. 


vpx codec internal.h 


rtm Begin code block === 2224522220932 2 SH HS 52 


/ 


+ + X Xo + Ro F F F 


Copyright (c) 2010, 2011, Google Inc. All rights reserved. 


Use of this source code is governed by a BSD-style license 

that can be found in the LICENSE file in the root of the source 
tree. An additional intellectual property rights grant can be 
found in the file PATENTS. All contributing project authors may 
be found in the AUTHORS file in the root of the source tree. 


/*!Nfile vpx codec internal.h 


+ + A 0X 0X £ | | 0X 0X F FF OR OR ooo ook 0 0 0 0 0 0 F OR OR Ro OF 


\brief Describes the decoder algorithm interface for algorithm 
implementations. 


This file defines the private structures and data types that are 
only relevant to implementing an algorithm, as opposed to using 
it. 


To create a decoder algorithm class, an interface structure is put 
into the global namespace: 


«pre» 
my codec.c: 
vpx. codec iface t my codec = { 
"My Codec v1.0", 
VPX CODEC ALG ABI VERSION, 
y; 
</pre> 


An application instantiates a specific decoder instance by using 


vpx_codec_init() and a pointer to the algorithm's interface 
structure: 

<pre> 

my_app.c: 


extern vpx_codec_iface_t my_codec; 


{ 


vpx_codec_ctx_t algo; 
res = vpx codec init(&algo, &my codec); 
} 
</pre> 
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* Once initialized, the instance is managed using other functions 
* from the vpx codec * family. 

E 

#ifndef VPX CODEC INTERNAL H 

#define VPX CODEC INTERNAL H 

finclude "vpx decoder.h" 

finclude <stdarg.h> 


/*!Nbrief Current ABI version number 


\internal 

If this file is altered in any way that changes the Application 
Binary Interface (ABI), this value must be bumped. Examples 
include, but are not limited to, changing types, removing or 
reassigning enums, adding/removing/rearranging fields to 

* structures. 

*/ 
#define VPX CODEC INTERNAL ABI VERSION (3) 


MALE A. P. 


typedef struct vpx codec alg priv  vpx codec alg priv t; 


/*!\brief init function pointer prototype 


* 

* Performs algorithm-specific initialization of the decoder context. 
* This function is called by the generic vpx codec init() wrapper 
* function, so plugins implementing this interface may trust the 
* input parameters to be properly initialized. 

* 

* Nparam[in] ctx Pointer to this instance's context 

* \retval $VPX CODEC OK 

* The input stream was recognized and decoder initialized. 

x Nretval £VPX CODEC MEM ERROR 

* Memory operation failed. 

7 


typedef vpx codec err t ("vpx codec init fn t) (vpx codec ctx t "ctx); 
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/*!\brief destroy function pointer prototype 


* 

* Performs algorithm-specific destruction of the decoder context. 
* This function is called by the generic vpx codec destroy() wrapper 
* function, so plugins implementing this interface may trust the 
* input parameters to be properly initialized. 

* 

* Nparam[in] ctx Pointer to this instance's context 

* \retval $VPX CODEC OK 

še The input stream was recognized and decoder initialized. 

x Nretval £VPX CODEC MEM ERROR 

i Memory operation failed. 

X% 


typedef vpx codec err t (*vpx codec destroy fn t)( 
vpx codec alg priv t *ctx); 


/*!\brief parse stream info function pointer prototype 


size member \ref MUST be properly 
initialized, but \ref MAY be clobbered by 
the algorithm. This parameter \ref MAY 
be NULL. 


* 

* Performs high level parsing of the bitstream. This function is 
* called by the generic vpx codec parse stream() wrapper function, 
* so plugins implementing this interface may trust the input 

* parameters to be properly initialized. 

* 

* Nparam[in] data Pointer to a block of data to parse 

* Nparam[in] data sz Size of the data buffer 

x Nparam[in,out] si Pointer to stream info to update. The 
* 

* 

* 

* 

* 

* 


\retval 4VPX CODEC OK 
Bitstream is parsable and stream information updated 


+ 


zi 
typedef vpx codec err t (*vpx codec peek si fn t)( 
const uint8 t *data, 
unsigned int data_sz, 


Vpx codec stream info t *si); 
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/ * 
* 


* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 


!'\brief Return information about the current stream. 


Returns information about the stream that has been parsed during 
decoding. 

\param[in] ctx Pointer to this instance’s context 
\param[in,out] si Pointer to stream info to update. The 


size member \ref MUST be properly 
initialized, but \ref MAY be clobbered by 
the algorithm. This parameter \ref MAY 
be NULL. 


\retval #VPX_CODEC_OK 
Bitstream is parsable and stream information updated 


/ 


typedef vpx_codec_err_t (*vpx_codec_get_si_fn_t) ( 


/* 


+ + + + + oo | | CACA FF FF OR OR F 0b 6 F KF X 


vpx_codec_alg_priv_t *CEX, 
vpx_codec_stream_info_t *si); 


!\brief control function pointer prototype 


This function is used to exchange algorithm-specific data with the 
decoder instance. This can be used to implement features specific 
to a particular algorithm. 


This function is called by the generic vpx_codec_control() wrapper 
function, so plugins implementing this interface may trust the 
input parameters to be properly initialized. However, this 
interface does not provide type safety for the exchanged data or 
assign meanings to the control codes. Those details should be 
specified in the algorithm’s header file. In particular, the 
ctrl_id parameter is guaranteed to exist in the algorithm’s 
control mapping table, and the data parameter may be NULL. 


\param[in] ctx Pointer to this instance’s context 
\param[in] ctrl_id Algorithm-specific control identifier 
\param[in,out] data Data to exchange with algorithm instance. 


\retval #VPX_CODEC_OK 
The internal state data was deserialized. 


/ 


typedef vpx_codec_err_t (*vpx_codec_control_fn_t) ( 


Banko 


vpx codec alg priv t  *ctx, 
int ctrl id, 
va list ap); 
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/*!Nbrief control function pointer mapping 


This structure stores the mapping between control identifiers and 
implementing functions. Each algorithm provides a list of these 
mappings. This list is searched by the vpx codec control() 
wrapper function to determine which function to invoke. The 
Special value (0, NULL} is used to indicate end-of-list, and must 
be present. The special value (0, <non-null>) can be used as a 
catch-all mapping. This implies that ctrl id values chosen by the 
algorithm \ref MUST be non-zero. 

/ 

typedef const struct 

{ 


+ + + + ok oo xo xot x 


int ctrl id; 
Vpx codec control fn t fn; 
) vpx codec ctrl fn map t; 


/*!\brief decode data function pointer prototype 


Processes a buffer of coded data. If the processing results in a 
new decoded frame becoming available, $VPX CODEC CB PUT SLICE and 
4VPX CODEC CB PUT FRAME events are generated as appropriate. 

This function is called by the generic vpx codec decode() wrapper 
function, so plugins implementing this interface may trust the 
input parameters to be properly initialized. 


\param[in] ctx Pointer to this instance’s context 
\param[in] data Pointer to this block of new coded data. 
If NULL, a 4VPX CODEC CB PUT FRAME event is 
posted for the previously decoded frame. 
Nparam[in] data sz Size of the coded data, in bytes. 


\return Returns 4VPX CODEC OK if the coded data was processed 
completely and future pictures can be decoded without 
error. Otherwise, see the descriptions of the other error 
codes in ::vpx codec err t for recoverability 
capabilities. 


+ + X 0X + o Xo | F FF 0 0 0 0 Ro Ro o X 


/ 

typedef vpx codec err t (*vpx codec decode fn t)( 
vpx codec alg priv t  *ctx, 

const uint8 t *data, 

unsigned int data sz, 

void *user priv, 

long deadline); 
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/*!\brief Decoded frames iterator 


Iterates over a list of the frames available for display. The 
iterator storage should be initialized to NULL to start the 
iteration. Iteration is complete when this function returns NULL. 


The list of available frames becomes valid upon completion of the 
vpx codec decode call, and remains valid until the next call to 
vpx_codec_decode. 


\param[in] ctx Pointer to this instance’s context 
\param[in out] iter Iterator storage, initialized to NULL 


\return Returns a pointer to an image, if one is ready for 
display. Frames produced will always be in PTS 
(presentation time stamp) order. 


+ + + + + ok oo ook F F F F Ro X 


/ 

typedef vpx image t*(*vpx codec get frame fn t)( 
vpx codec alg priv t *ctx, 

vpx codec iter t *iter); 


/*Nbrief External Memory Allocation memory map get iterator 


* Iterates over a list of the memory maps requested by the decoder. 
* The iterator storage should be initialized to NULL to start the 
* iteration.  Iteration is complete when this function returns NULL. 
* 
* Nparam[in out] iter Iterator storage, initialized to NULL 
* 
* Nreturn Returns a pointer to a memory segment descriptor, or NULL 
k to indicate end-of-list. 
8] 
typedef vpx codec err t ("vpx codec get mmap fn t)( 
const vpx codec ctx t *CEX; 
Vpx codec mmap t *mmap, 
vpx codec iter t *iter); 
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/*Nbrief External Memory Allocation memory map set iterator 
Sets a memory descriptor inside the decoder instance. 


\param[in] ctx Pointer to this instance’s context 
\param[in] mmap Memory map to store. 


Nretval #VPX_CODEC_OK 

The memory map was accepted and stored. 
\retval #VPX_CODEC_MEM_ERROR 

The memory map was rejected. 


+ + + + + oo Xo Xo F F X 


/ 

typedef vpx_codec_err_t (*vpx_codec_set_mmap_fn_t) ( 
vpx_codec_ctx_t *ctx, 

const vpx codec mmap t  *mmap); 


typedef vpx codec err t (*vpx codec encode fn t)( 
vpx codec alg priv t  *ctx, 


const vpx image t *img, 

Vpx codec pts t pts, 
unsigned long duration, 
Vpx enc frame flags t flags, 
unsigned long deadline); 


typedef const vpx codec cx pkt t*(*vpx codec get cx data fn t)( 
vpx codec alg priv t *ctx, 
vpx codec iter t *iter); 


typedef vpx codec err t 
(*vpx codec enc config set fn t)( 
vpx codec alg priv t *CEX, 
const vpx codec enc cfg t  "cfg); 
typedef vpx fixed buf t * 
(*vpx codec get global headers fn t) (vpx codec alg priv t *ctx); 


typedef vpx image t * 
(*vpx codec get preview frame fn t) (vpx codec alg priv t PECX); 


/*!\brief usage configuration mapping 

* 

This structure stores the mapping between usage identifiers and 
configuration structures. Each algorithm provides a list of these 
mappings. This list is searched by the 

vpx codec enc config default() wrapper function to determine which 


* xoxo X 
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* config to return. The special value {-1, {0}} is used to indicate 
* end-of-list, and must be present. At least one mapping must be 

* present, in addition to the end-of-list. 
* 
* 


/ 
typedef const struct 
{ 
int usage; 
Vpx codec enc cfg t cfg; 
) vpx codec enc cfg map t; 


#define NOT IMPLEMENTED 0 


/*!\brief Decoder algorithm interface 
* 


x All decoders \ref MUST expose a variable of this type. 
+y 
struct vpx_codec_iface 


{ 


const char *name; 
int abi version; 
vpx codec caps t caps; 
vpx_codec_init_fn_t init; 
vpx_codec_destroy_fn_t destroy; 
Vpx codec ctrl fn map t  "ctrl maps; 
vpx codec get mmap fn t get mmap; 
vpx codec set mmap fn t set mmap; 
struct 
Vpx codec peek si fn t peek si; 
vpx codec get si fn t get si; 
Vpx codec decode fn t decode; 
Vpx codec get frame fn t get frame; 
) dec; 
struct 
{ 
vpx_codec_enc_cfg_map_t *cfg_maps; 
vpx_codec_encode_fn_t encode; 
vpx_codec_get_cx_data_fn_t get cx data; 
Vpx codec eno config set fn t cfg set; 
Vpx codec get global headers fn t get glob hdrs; 
Vpx codec get preview frame fn t get preview; 
) enc; 


y; 
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/*!\brief Callback function pointer / user data pair storage */ 
typedef struct vpx codec priv cb pair 


{ 


union 
{ 
vpx_codec_put_frame_cb_fn_t put_frame; 
vpx_codec_put_slice_cb_fn_t put_slice; 
y; 
void *user_priv; 


) vpx codec priv cb pair t; 


/*!Nbrief Instance private storage 


This structure is allocated by the algorithm's init function. It 
can be extended in one of two ways. First, a second, algorithm 
specific structure can be allocated and the priv member pointed to 
it. Alternatively, this structure can be made the first member of 
the algorithm-specific structure, and the pointer casted to the 

* proper type. 

AY 
struct vpx_codec_priv 


{ 


* xoxo Ro xo > 


unsigned int SZ; 
vpx_codec_iface_t *iface; 
struct vpx_codec_alg_priv *alg_priv; 
const char *err detail; 
vpx_codec_flags_t init_flags; 
struct 
{ 
vpx codec priv cb pair t put frame cb; 
vpx. codec priv cb pair t put slice cb; 
dec; 
struct 
struct vpx fixed buf cx data dst buf; 
unsigned int cx data pad before; 
unsigned int cx data pad after; 
Vpx codec cx pkt t cx data pkt; 
) enc; 


y; 


fundef VPX CTRL USE TYPE 

#define VPX CTRL USE TYPE(id, typ) \ 
static typ id##__ value (va_list args) \ 
(return va arg(args, typ);) N 
static typ id##__convert (void "x)A 
{\ 
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union\ 
{\ 
void *x;\ 
typ d;\ 
) u;N 
u.X = x;\ 


return u.d; \ 


#undef VPX CTRL USE TYPE DEPRECATED 


#define VPX CTRL USE TYPE DEPRECATED(id, typ) 


static typ id##__ value (va_list args) \ 
(return va arg(args, typ);} \ 
static typ id##__convert (void *x) \ 


{\ 
union\ 
{\ 
void *x;\ 
typ d;\ 
) AEN 
u.X = x;\ 


return u.d;\ 


} 


#define CAST(id, arg) id##__value (arg) 
#define RECAST (id, x) id##__convert (x) 


/* Internal Utility Functions 
* 


* 
* 


El 
struct vpx_codec_pkt_list 
{ 
unsigned int cnt; 
unsigned int max; 


struct vpx codec cx pkt pkts[1]; 
y; 


define vpx codec pkt list decl(n)N 
union (struct vpx codec pkt list head;A 
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N 


struct {struct vpx codec pkt list head;A 


struct vpx codec cx pkt 


Informational 


pkts[n];) alloc;) 


November 2011 


The following functions are intended to be used inside algorithms 
as utilities for manipulating vpx codec * data structures. 


[Page 261] 


RFC 6386 VP8 Data Format and Decoding Guide November 2011 


#define vpx codec pkt list init (m) \ 
(m) ->alloc.head.cnt < 0,N 
(m) ->alloc.head.max < N 
sizeof((m)->alloc.pkts) / sizeof((m)->alloc.pkts[0]) 


int 
vpx codec pkt list add(struct vpx codec pkt list *, 
const struct vpx codec cx pkt *); 


const vpx codec cx pkt t* 
vpx codec pkt list get(struct vpx codec pkt list "list, 
vpx codec iter t *iter); 


finclude <stdio.h> 

finclude <setjmp.h> 

struct vpx internal error info 
( 


Vpx codec err t error code; 


int has, detail; 
char detail[80]; 
int setjmp; 

jmp buf jmp; 


y; 


static void vpx_internal_error(struct vpx_internal_error_info *info, 
Vpx codec err t error, 
const char *fmt, 


.) 


va_list ap; 


info->error_code = error; 
info->has_detail 0; 


if (fmt) 
{ 


size t sz = sizeof (info->detail); 


info->has_detail = 1; 

va_start(ap, fmt); 
vsnprintf(info->detail, sz - 1, fmt, ap); 
va end(ap); 

info->detail[sz-1] = ’\0’; 
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20 


if (info->setjmp) 
longjmp (info->jmp, info->error code); 
} 
fendif 


tease! MS Yap 


.20. vpx decoder.h 


-.-c-- Begin code block ======="======2 === === 


/* 
* Copyright (c) 2010, 2011, Google Inc. All rights reserved. 

* 

* Use of this source code is governed by a BSD-style license 

* that can be found in the LICENSE file in the root of the source 
x tree. An additional intellectual property rights grant can be 

x found in the file PATENTS. All contributing project authors may 
* be found in the AUTHORS file in the root of the source tree. 

El 


/*!Ndefgroup decoder Decoder Algorithm Interface 

\ingroup codec 

This abstraction allows applications using this decoder to easily 
support multiple video formats with minimal code duplication. 
This section describes the interface common to all decoders. 

@{ 

/ 


wo ME NP GE E 


/*!\file vpx decoder.h 
\brief Describes the decoder algorithm interface to applications. 


This file describes the interface between an application and a 
video decoder algorithm. 


+ xoxo DR x > 


/ 

#ifdef cplusplus 
extern "C" { 

#endif 

#ifndef VPX_DECODER_H 
#define VPX_DECODER_H 
#include "vpx codec.h" 
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'\brief Current ABI version number 


\internal 

If this file is altered in any way that changes the ABI, this 
value must be bumped. Examples include, but are not limited 
to, changing types, removing or reassigning enums, 
adding/removing/rearranging fields to structures 


El, 
#define VPX DECODER ABI VERSION (2 + VPX CODEC ABI VERSION) 


/ 


+ + xo + X Ro Xo Xo X 


Y 


#defin 
ca 
#defin 
ca 
#defin 
fr 


/ 


+ + £ Xo Xo HF > 


#defin 
fr 


/ * 


+ xoxo DR xo X 
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\brief Decoder capabilities bitfield 


Each decoder advertises the capabilities it supports as part 
of its ::vpx_codec_iface_t interface structure. Capabilities 
are extra interfaces or functionality, and are not required 
to be supported by a decoder. 


The available flags are specified by VPX_CODEC_CAP_* defines. 


e VPX CODEC CAP PUT SLICE  0x10000 /**« Will issue put slice 
llbacks */ 

e VPX CODEC CAP PUT FRAME  0x20000 /**< Will issue put frame 
llbacks */ 

e VPX CODEC CAP POSTPROC 0x40000 /**« Can postprocess decoded 
ame */ 


! \brief Initialization-time Feature Enabling 


Certain codec features must be known at initialization time, 
to allow for proper memory allocation. 


The available flags are specified by VPX CODEC USE * defines. 


/ 
e VPX CODEC USE POSTPROC 0x10000 /**« Postprocess decoded 


ame */ 

'\brief Stream properties 

This structure is used to query or set properties of the 
decoded stream. Algorithms may extend this structure with 


data specific to their bitstream by setting the sz member 
appropriately. 
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typedef struct vpx codec stream info 


( 


unsigned int sz; /**« Size of this structure */ 
unsigned int w; /**« Width (or 0 for unknown/default) */ 
unsigned int h; /**« Height (or 0 for unknown/default) */ 


unsigned int is kf; /**« Current frame is a keyframe */ 
) vpx codec stream info t; 


/* REQUIRED FUNCTIONS 
* 


The following functions are required to be implemented for all 
* decoders. They represent the base case functionality expected 
of all decoders. 


Ef 


/*!Nbrief Initialization Configurations 
* 
* This structure is used to pass init time configuration options 
* to the decoder. 
*/ 
typedef struct vpx codec dec cfg 
{ 
unsigned int threads; /**< Maximum number of threads to use, 
default 1 */ 
unsigned int w; /**< Width */ 
unsigned int h; /**< Height */ 
} vpx codec dec cfg t; /**« alias for struct vpx codec dec cfg */ 


/*!Nbrief Initialize a decoder instance 


* 

* Initializes a decoder context using the given interface. 

* Applications should call the vpx codec dec init convenience 

* macro instead of this function directly, to ensure that the 

* ABI version number parameter is properly initialized. 

* 

* In XMA mode (activated by setting VPX CODEC USE XMA in the 

* flags parameter), the storage pointed to by the cfg parameter 
* must be kept readable and stable until all memory maps have 

* been set. 

* 

* Nparam[in] ctx Pointer to this instance's context. 

* Nparam[in] iface Pointer to the algorithm interface to 

* use. 

* Nparam[in] cfg Configuration to use, if known. May be 
* NULL. 
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\param[in] 
\param[in] 


flags Bitfield of VPX_CODEC_USE_* flags 
ver ABI version number. Must be set to 
VPX_DECODER_ABI_VERSION 


\retval #VPX_CODEC_OK 

The decoder algorithm initialized. 
\retval #VPX_CODEC_MEM_ERROR 

Memory allocation failed. 


vpx codec err t vpx codec dec init ver( 


vpx codec ctx t OCX 
vpx_codec_iface_t *iface, 
Vpx codec dec cfg t *cfg, 
vpx codec flags t flags, 
int ver); 


/*!\brief Convenience macro for vpx codec dec init ver() 


* 


* Ensures the ABI version parameter is properly set. 


*/ 


#define vpx codec dec init(ctx, iface, cfg, flags) \ 
vpx codec dec init ver(ctx, iface, cfg, flags, N 
VPX DECODER ABI VERSION) 


/*!\brief Parse stream info from a buffer 


* 


+ + x X + Xo | | F Xo o Ro Ro xo Xo OF 
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Performs high level parsing of the bitstream. Construction of 
a decoder context is not necessary. Can be used to determine 
if the bitstream is of the proper format, and to extract 
information from the stream. 


\param[in] 
\param[in] 
\param[in] 


iface Pointer to the algorithm interface 
data Pointer to a block of data to parse 
data_sz Size of the data buffer 


\param[in,out] si Pointer to stream info to update. The 


size member 

Xref MUST be properly initialized, but 
\ref MAY be clobbered by the 
algorithm. This parameter \ref MAY be 
NULL. 


\retval #VPX_CODEC_OK 
Bitstream is parsable and stream information updated 
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Vpx codec err t vpx codec peek stream info( 


/ * 


* 


* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 


vpx codec iface t *iface, 
const uint8 t *data, 
unsigned int data_sz, 


Vpx codec stream info t *si); 
!\brief Return information about the current stream. 


Returns information about the stream that has been parsed 
during decoding. 


\param[in] ctx Pointer to this instance’s context 

\param[in,out] si Pointer to stream info to update. The 
size member \ref MUST be properly 
initialized, but \ref MAY be clobbered 
by the algorithm. This parameter \ref 
MAY be NULL. 


\retval #VPX_CODEC_OK 
Bitstream is parsable and stream information updated 


/ 


vpx_codec_err_t vpx_codec_get_stream_info ( 


/* 


+ + + + oko ok | 0 FF E 0 OR OR OR F F HF X 
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vpx_codec_ctx_t *ctx, 
vpx_codec_stream_info_t *si); 


!Nbrief Decode data 

Processes a buffer of coded data. If the processing results 

in a new decoded frame becoming available, PUT_SLICE and 

PUT_FRAME events may be generated, as appropriate. Encoded 

data \ref MUST be passed in DTS (decode time stamp) order. 

Frames produced will always be in PTS (presentation time 

stamp) order. 

\param[in] ctx Pointer to this instance’s context 

\param[in] data Pointer to this block of new coded 
data. If NULL, a 
VPX CODEC CB PUT FRAME event is posted 
for the previously decoded frame. 

\param[in] data sz Size of the coded data, in bytes. 

\param[in] user priv Application-specific data to associate 
with this frame. 

\param[in] deadline Soft deadline the decoder should 
attempt to meet, in us. Set to zero 
for unlimited. 
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x Nreturn Returns 4VPX CODEC OK if the coded data was processed 

x completely and future pictures can be decoded without 

x error. Otherwise, see the descriptions of the other 

ki error codes in ::vpx codec err t for recoverability 

x capabilities. 

x 

vpx codec err t vpx codec decode (vpx codec ctx t *ctx, 

const uint8 t *data, 
unsigned int data sz, 
void *user priv, 
long deadline); 


/*!Nbrief Decoded frames iterator 


* 


+ + + + + CACA ooo 06 06 o6 o Ro X 


Iterates over a list of the frames available for display. The 
iterator storage should be initialized to NULL to start the 
iteration.  Iteration is complete when this function returns 
NULL. 


The list of available frames becomes valid upon completion of 
the vpx codec decode call, and remains valid until the next 
call to vpx codec decode. 


\param[in] ctx Pointer to this instance’s context 
\param[in,out] iter Iterator storage, initialized to NULL 


\return Returns a pointer to an image, if one is ready for 
display. Frames produced will always be in PTS 
(presentation time stamp) order. 


vpx image t *vpx codec get frame(vpx codec ctx t  *ctx, 


vpx. codec iter t "iter); 


/*!Ndefgroup cap put frame Frame-Based Decoding Functions 


+ + x Xo + F Xo X 
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The following functions are required to be implemented for all 
decoders that advertise the VPX CODEC CAP PUT FRAME 
capability. Calling these functions for codecs that don't 
advertise this capability will result in an error code being 
returned, usually VPX CODEC ERROR 

@{ 
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/*!\brief put frame callback prototype 


* 


* 


* 


* 


/ 


This callback is invoked by the decoder to notify the 
application of the availability of decoded image data. 


typedef void (*vpx codec put frame cb fn t)( 


void *user priv, 
const vpx image t *img); 


/*!Nbrief Register for notification of frame completion. 


+ + + + + + o F F F F F X 


S 


Registers a given function to be called when a decoded frame 
is available. 


\param[in] ctx Pointer to this instance’s context 
\param[in] cb Pointer to the callback function 
\param[in] user_priv User’s private data 


\retval #VPX_CODEC_OK 
Callback successfully registered. 

\retval #VPX_CODEC_ERROR 
Decoder context not initialized, or algorithm not capable 
of posting slice completion. 


Vpx codec err t vpx codec register put frame cb( 
Vpx codec ctx t RCE, 
Vpx codec put frame cb fn t cb, 
void *user priv); 


/*!@} - end defgroup cap put frame */ 


/*!Ndefgroup cap put slice Slice-Based Decoding Functions 


+ + + + + F HF F 


/ 


The following functions are required to be implemented for all 
decoders that advertise the VPX_CODEC_CAP_PUT_SLICE 
capability. Calling these functions for codecs that don’t 
advertise this capability will result in an error code being 
returned, usually VPX_CODEC_ERROR 

@{ 


/*!\brief put slice callback prototype 


* 
* 
* 
* 
* 
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/ 


This callback is invoked by the decoder to notify the 
application of the availability of partially decoded image 
data. 
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typedef void (*vpx codec put slice cb fn t)( 


[RI 


+ + + Xo + o Xo F F F xo 


+ 


*/ 


void *user priv, 

const vpx image t *img, 
const vpx image rect t *valid, 
const vpx image rect t *update); 


\brief Register for notification of slice completion. 


Registers a given function to be called when a decoded slice 
is available. 


\param[in] ctx Pointer to this instance’s context 
\param[in] cb Pointer to the callback function 
\param[in] user_priv User’s private data 


\retval #VPX_CODEC_OK 
Callback successfully registered. 

\retval #VPX_CODEC_ERROR 
Decoder context not initialized, or algorithm not capable 
of posting slice completion. 


Vpx codec err t vpx codec register put slice cb( 
Vpx codec ctx t *CEX; 
vpx codec put slice cb fn t cb, 
void user priv); 


/*!@} — end defgroup cap put slice*/ 
/*!@} - end defgroup decoder*/ 
fendif 
#ifdef _ cplusplus 
ERE 
Hif !defined(VPX CODEC DISABLE COMPAT) | | !VPX CODEC DISABLE COMPAT 


finclude "vpx decoder compat.h" 


Hendif 


Toe BNGVCOGe: DOCK == A ee Se eee 
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20.21 


Vpx decoder compat.h 


“m= Begin code block =====222==45222 220232125 == 


/ 


+ + X Xo Ro Ro F F F 


/ 


Copyright (c) 2010, 2011, Google Inc. All rights reserved. 


Use of this source code is governed by a BSD-style license 

that can be found in the LICENSE file in the root of the source 
tree. An additional intellectual property rights grant can be 
found in the file PATENTS. All contributing project authors may 
be found in the AUTHORS file in the root of the source tree. 


/*!Ndefgroup decoder Common Decoder Algorithm Interface 


* 


* 


* 


* 


This abstraction allows applications using this decoder to easily 
support multiple video formats with minimal code duplication. 
This section describes the interface common to all codecs. 

@{ 


El 
/*!\file 
* \brief Provides a compatibility layer between version 1 and 2 of 
* this API. 
* 
* This interface has been deprecated. Only existing code should 
* make use of this interface, and therefore, it is only thinly 
* documented. Existing code should be ported to the vpx codec * 
TOABRI. 
* 
#ifdef cplusplus 


extern "C" ( 
dendif 


#ifndef VPX DECODER COMPAT H 
#define VPX DECODER COMPAT H 
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/*!Nbrief Decoder algorithm return codes */ 
typedef enum ( 
/*!Nbrief Operation completed without error */ 
VPX DEC OK = VPX CODEC OK, 


/*!\brief Unspecified error */ 
VPX_DEC_ERROR = VPX_CODEC_ERROR, 


/*!Nbrief Memory operation failed */ 
VPX_DEC_MEM ERROR = VPX_CODEC_MEM ERROR, 


/*!\brief ABI version mismatch */ 
VPX DEC ABI MISMATCH = VPX CODEC ABI MISMATCH, 


/*!\brief The given bitstream is not supported. 
* 


* The bitstream was unable to be parsed at the highest 


* level. The decoder is unable to proceed. This error \ref 
* SHOULD be treated as fatal to the stream. 
biči 


VPX DEC UNSUP BITSTREAM < VPX CODEC UNSUP BITSTREAM, 


/*!\brief Encoded bitstream uses an unsupported feature 


The decoder does not implement a feature required by the 
encoder. This return code should only be used for 
features that prevent future pictures from being properly 
decoded. This error \ref MAY be treated as fatal to the 
stream or \ref MAY be treated as fatal to the current 

* Group of Pictures (GOP). 

*/ 
VPX DEC UNSUP FEATURE - VPX CODEC UNSUP FEATURE, 


+ + + D F 


/*!Nbrief The coded data for this stream is corrupt or 
incomplete 


There was a problem decoding the current frame. This 
return code should only be used for failures that prevent 
future pictures from being properly decoded. This error 
Xref MAY be treated as fatal to the stream or \ref MAY be 
treated as fatal to the current GOP. If decoding is 
* continued for the current GOP, artifacts may be present. 
87 
VPX DEC CORRUPI FRAME - VPX CODEC CORRUPT FRAME, 


D p EE E UD NP z 
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/*!\brief An application-supplied parameter is not valid. 
* 
Fl 

VPX DEC INVALID PARAM = VPX CODEC INVALID PARAM, 


/*!Nbrief An iterator reached the end of list. 
* 
* 

VPX DEC LIST END - VPX CODEC LIST END 


vpx dec err t; 


/ 


*/ 


\brief Decoder capabilities bitfield 


Each decoder advertises the capabilities it supports as part 
of its ::vpx_dec_iface_t interface structure. Capabilities 
are extra interfaces or functionality, and are not required 
to be supported by a decoder. 


The available flags are specified by VPX_DEC_CAP_* defines. 


typedef int vpx_dec_caps_t; 
#define VPX_DEC_CAP_PUT_SLICE 0x0001 /**< Will issue put_slice 


callbacks */ 


#define VPX_DEC_CAP_PUT_FRAME 0x0002 /**< Will issue put_frame 


callbacks */ 


#define VPX_DEC_CAP_XMA 0x0004 /**< Supports External Memory 


Allocation */ 


/*!\brief Stream properties 


Hif 1 


* 


+ xoxo £ X 


/ 


This structure is used to query or set properties of the 
decoded stream. Algorithms may extend this structure with 
data specific to their bitstream by setting the sz member 
appropriately. 


typedef vpx codec stream info t vpx dec stream info t; 


#else 


typedef struct 


{ 


unsigned int sz; /**< Size of this structure */ 
unsigned int w; /**« Width (or 0 for unknown/default) */ 
unsigned int h; /**« Height (or 0 for unknown/default) */ 


unsigned int is kf; /**« Current frame is a keyframe */ 


) vpx dec stream info t; 
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Hendif 


/*!\brief Decoder interface structure. 
* 
* Contains function pointers and other data private to the 
* decoder implementation. This structure is opaque to the 
* application. 
EY 
typedef const struct vpx codec iface vpx dec iface t; 
typedef struct vpx codec priv  vpx dec priv t; 


/*!\brief Iterator 
* 


* Opaque storage used for iterating over lists. 
*/ 
typedef vpx codec iter t vpx dec iter t; 


/*!\brief Decoder context structure 


* 
* All decoders \ref MUST support this context structure fully. 
* In general, this data should be considered private to the 
* decoder algorithm, and not be manipulated or examined by the 
* calling application. Applications may reference the 'name' 
* member to get a printable description of the algorithm. 
*/ 
#if 1 
typedef vpx codec ctx t vpx dec ctx t; 
felse 


typedef struct 


const char "name; /**« Printable interface name */ 

vpx dec iface t *iface; /**« Interface pointers */ 

vpx dec err t err; /**« Last returned error */ 

vpx dec priv t *priv; /**< Algorithm private storage */ 
) vpx dec ctx t; 


Hendif 


/*!\brief Return the build configuration 
* 
* Returns a printable string containing an encoded version of 


* the build configuration. This may be useful to vpx support. 
* 


*/ 
const char *vpx dec build config(void) DEPRECATED; 
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'\brief Return the name for a given interface 


Returns a human readable string for name of the given decoder 
interface. 

Nparam[in] iface Interface pointer 

/ 


const char *vpx dec iface name( 


/ * 
* 


+ + AR Xo xo F 


* 


* 


vpx dec iface t *iface) DEPRECATED; 


!Nbrief Convert error number to printable string 


Returns a human readable string for the last error returned 
by the algorithm. The returned error will be one line and 
will not contain any newline characters. 

\param[in] err Error number. 

/ 


const char *vpx dec err to string(vpx dec err t err) DEPRECATED; 


/ * 
* 


+ + + + + X X ox 


'\brief Retrieve error synopsis for decoder context 
Returns a human readable string for the last error returned by 


the algorithm. The returned error will be one line and will 
not contain any newline characters. 


\param[in] ctx Pointer to this instance’s context. 


/ 


const char *vpx dec error(vpx dec ctx t *ctx) DEPRECATED; 


/* 


* 


+ RR HF X 
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'\brief Retrieve detailed error information for decoder context 


Returns a human readable string providing detailed information 
about the last error. 


\param[in] ctx Pointer to this instance’s context. 
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Vpx 


#define 


VpX | 


/*! 
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\retval NULL 
No detailed information is available. 


st char *vpx dec error detail(vpx dec ctx t *ctx) DEPRECATED; 


REQUIRED FUNCTIONS 


The following functions are required to be implemented for all 
decoders. They represent the base case functionality expected 
of all decoders. 


\brief Initialize a decoder instance 


Initializes a decoder context using the given interface. 
Applications should call the vpx dec init convenience macro 
instead of this function directly, to ensure that the ABI 
version number parameter is properly initialized. 


Nparam[in] ctx Pointer to this instance's context. 
\param[in] iface Pointer to the algorithm interface to use. 
\param[in] ver ABI version number. Must be set to 


VPX_DECODER_ABI_VERSION 
\retval #VPX_DEC_OK 
The decoder algorithm initialized. 
Nretval #VPX_DEC_MEM_ERROR 
Memory allocation failed. 


dec_err_t vpx_dec_init_ver ( 
vpx_dec_ctx_t KOEX; 

vpx dec iface t  "iface, 

int ver) DEPRECATED; 


vpx dec init(ctx, iface) \ 
dec init ver(ctx, iface, VPX DECODER ABI VERSION) 


\brief Destroy a decoder instance 


Destroys a decoder context, freeing any associated memory 
buffers. 


\param[in] ctx Pointer to this instance’s context 


\retval #VPX_DEC_OK 
The decoder algorithm initialized. 
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x Nretval £4$VPX DEC MEM ERROR 
X Memory allocation failed. 
El 
Vpx dec err t vpx dec destroy(vpx dec ctx t *ctx) DEPRECATED; 


/*!Nbrief Get the capabilities of an algorithm. 
* 


Retrieves the capabilities bitfield from the algorithm's 
interface. 


* 
* 
* Nparam[in] iface Pointer to the algorithm interface 
* 
Ef 


Vpx dec caps t vpx dec get caps( 
vpx dec iface t *iface) DEPRECATED; 


/*!\brief Parse stream info from a buffer 


* 
* Performs high level parsing of the bitstream. Construction of 
* a decoder context is not necessary. Can be used to determine 
* if the bitstream is of the proper format, and to extract 
* information from the stream. 
* 
* Nparam[in] iface Pointer to the algorithm interface 
* Nparam[in] data Pointer to a block of data to parse 
* Nparam[in] data sz Size of the data buffer 
x Nparam[in,out] si Pointer to stream info to update. The 
* size member \ref MUST be properly 
* initialized, but \ref MAY be 
* clobbered by the algorithm. This 
x parameter \ref MAY be NULL. 
* 
* Nretval 4VPX DEC OK 
ia Bitstream is parsable and stream information updated 
*/ 
vpx dec err t vpx dec peek stream info( 
vpx dec iface t *iface, 
const uint8 t *data, 
unsigned int data_sz, 


vpx dec stream info t *si) DEPRECATED; 
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/*!Nbrief Return information about the current stream. 


* Returns information about the stream that has been parsed 

* during decoding. 

* 

* Nparam[in] ctx Pointer to this instance's context 

x Nparam[in,out] si Pointer to stream info to update. 

* The size member \ref MUST be properly 

* initialized, but \ref MAY be clobbered 
a by the algorithm. This parameter \ref 
* MAY be NULL. 

* 

* Nretval 4VPX DEC OK 

ie Bitstream is parsable and stream information updated 

*/ 


vpx dec err t vpx dec get stream info( 


vpx dec ctx t KOER 
vpx_dec_stream_info_t *si) DEPRECATED; 


/*!\brief Control algorithm 


* 


+ + A 0X + 0o oo | F F F FF 0 FF 0 0 OX OR OR F Ro X 


This function is used to exchange algorithm-specific data with 
the decoder instance. This can be used to implement features 
Specific to a particular algorithm. 


This wrapper function dispatches the request to the helper 
function associated with the given ctrl id. It tries to call 
this function transparently, but will return 4VPX DEC ERROR if 
the request could not be dispatched. 


\param[in] ctx Pointer to this instance’s context 
Nparam[in] ctrl id Algorithm-specific control 
identifier 
Nparam[in,out] data Data to exchange with algorithm 
instance. 
\retval £4VPX DEC OK 
The control reguest was processed. 
\retval 4VPX DEC ERROR 
The control reguest was not processed. 
Xretval £VPX DEC INVALID PARAM 
The data was not valid. 
/ 


vpx dec err t vpx dec control(vpx dec ctx t  *ctx, 
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int ctrl id, 
void *data) DEPRECATED; 
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\brief Decode data 


Processes a buffer of coded data. If the processing results 
in a new decoded frame becoming available, 
#VPX_DEC_CB_PUT_SLICE and 4VPX DEC CB PUT FRAME events may be 
generated, as appropriate. Encoded data \ref MUST be passed 
in DTS (decode time stamp) order. Frames produced will always 
be in PTS (presentation time stamp) order. 


\param[in] ctx Pointer to this instance’s context 

\param[in] data Pointer to this block of new coded 
data. If NULL, a VPX_DEC_CB_PUT_FRAME 
event is posted for the previously 
decoded frame. 


\param[in] data_sz Size of the coded data, in bytes. 

\param[in] user_priv Application-specific data to associate 
with this frame. 

\param[in] rel_pts PTS relative to the previous frame, in 
us. If unknown or unavailable, set to 
zero. 


\return Returns #VPX_DEC_OK if the coded data was processed 
completely and future pictures can be decoded without 
error. Otherwise, see the descriptions of the other 
error codes in ::vpx_dec_err_t for recoverability 
capabilities. 


vpx_dec_err_t vpx_dec_decode ( 


Rai 
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vpx dec ctx t  *ctx, 


uint8 t data, 

unsigned int data sz, 

void user priv, 

int rel pts) DEPRECATED; 


\brief Decoded frames iterator 


Iterates over a list of the frames available for display. The 
iterator storage should be initialized to NULL to start the 
iteration.  Iteration is complete when this function returns 
NULL. 


The list of available frames becomes valid upon completion of 
the vpx dec decode call, and remains valid until the next call 


to vpx dec decode. 


\param[in] ctx Pointer to this instance’s context 
\param[in out] iter Iterator storage, initialized to NULL 
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Nreturn Returns a pointer to an image, if one is ready for 
display. Frames produced will always be in PTS 
(presentation time stamp) order. 


+ RA F > 


/ 
vpx image t *vpx dec get frame(vpx dec ctx t  *ctx, 
vpx dec iter t *iter) DEPRECATED; 


/*!Ndefgroup cap put frame Frame-Based Decoding Functions 


The following functions are required to be implemented for all 
decoders that advertise the VPX DEC CAP PUT FRAME capability. 
Calling these functions for codecs that don't advertise this 
capability will result in an error code being returned, 
usually VPX DEC ERROR 641 

/ 


+ + + + F F > 


/*!\brief put frame callback prototype 
* 


* This callback is invoked by the decoder to notify the 
* application of the availability of decoded image data. 
*/ 
typedef void (*vpx dec put frame cb fn t)( 
void *user priv, 
const vpx image t *img); 


/*!Nbrief Register for notification of frame completion. 


* 
* Registers a given function to be called when a decoded frame 
* is available. 
* 
* Nparam[in] ctx Pointer to this instance's context 
* Nparam[in] cb Pointer to the callback function 
* Nparam[in] user priv User's private data 
* 
x Nretval £$VPX DEC OK 
$ Callback successfully registered. 
x Nretval £VPX DEC ERROR 
ki Decoder context not initialized, or algorithm not capable 
X of posting slice completion. 
87 
vpx dec err t vpx dec register put frame cb( 
vpx dec ctx t *otx, 
vpx dec put frame cb fn t cb, 
void *user priv) DEPRECATED; 
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/*!@} - end defgroup cap put frame */ 
/*!Ndefgroup cap put slice Slice-Based Decoding Functions 


The following functions are required to be implemented for all 
decoders that advertise the VPX DEC CAP PUT SLICE capability. 
Calling these functions for codecs that don't advertise this 
capability will result in an error code being returned, 
usually VPX DEC ERROR 

@{ 

/ 


+ + + + S > 


/*!\brief put slice callback prototype 
* 


* This callback is invoked by the decoder to notify the 

* application of the availability of partially decoded image 

* data. 

E, 

typedef void (*vpx dec put slice cb fn t) (void *user priv, 

const vpx image t *img, 
const vpx image rect t *valid, 
const vpx image rect t *update); 


/*!\brief Register for notification of slice completion. 
* 


* Registers a given function to be called when a decoded slice 

* is available. 

* 

* Nparam[in] ctx Pointer to this instance's context 

* Nparam[in] cb Pointer to the callback function 

* Nparam[in] user priv User's private data 

* 

x Nretval £$VPX DEC OK 

E Callback successfully registered. 

x Nretval £VPX DEC ERROR 

i Decoder context not initialized, or algorithm not capable 

y of posting slice completion. 

E 

vpx_dec_err_t vpx_dec_register_put_slice_cb(vpx_dec_ctx_t ACEX, 

vpx dec put slice cb fn t cb, 
void *user priv) DEPRECATED; 


/*!@} - end defgroup cap put slice*/ 
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/*!Ndefgroup cap xma External Memory Allocation Functions 
* 
* The following functions are required to be implemented for all 
* decoders that advertise the VPX DEC CAP XMA capability. 
* Calling these functions for codecs that don't advertise this 
* capability will result in an error code being returned, 
* usually VPX DEC ERROR 
x @{ 
Ny 
/*!\brief Memory Map Entry 
* 
* This structure is used to contain the properties of a memory 
* segment. It is populated by the decoder in the request phase, 
* and by the calling application once the requested allocation 
* has been performed. 
$y 
#if 1 
#define VPX_DEC_MEM_ZERO 0x1 /**< Segment must be zeroed by 


allocation */ 


#define VPX_DEC_MEM_WRONLY 0x2 /**< Segment need not be 


readable */ 


#define VPX DEC MEM FAST 0x4 /**< Place in fast memory, if 


available */ 


typedef struct vpx codec mmap vpx dec mmap t; 


#else 


typedef struct vpx_dec_mmap 


{ 


/* 
* The following members are set by the codec when requesting 
* a segment 


el 

unsigned int id; /**« identifier for the segment's 
contents */ 

unsigned long sz; /**« size of the segment, in bytes */ 

unsigned int align;  /**« required alignment of the 
segment, in bytes */ 

unsigned int flags; /**< bitfield containing segment 
properties */ 

#define VPX DEC MEM ZERO Ox1 /**< Segment must be zeroed by 


allocation */ 


#define VPX DEC MEM WRONLY 0x2  /**« Segment need not be 


readable */ 


#define VPX DEC MEM FAST 0x4 /**< Place in fast memory, if 
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available */ 
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/* The following members are to be filled in by the 
* allocation function */ 


void *base; /**« pointer to the allocated 
segment */ 
void (*dtor) (struct vpx dec mmap *map);  /%%< destructor to 
call */ 
void *priv; /**« allocator private storage */ 


) vpx dec mmap t; 


#endif 


/*! 
* 
* 


+ xo X A A A X X Xo Xo Xo Xo xoxo x 


#define 
vpx 


/*! 
* 
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\brief Initialize a decoder instance in external allocation 
mode 


Initializes a decoder context using the given interface. 
Applications should call the vpx_dec_xma_init convenience 
macro instead of this function directly, to ensure that the 
ABI version number parameter is properly initialized. 


\param[in] ctx Pointer to this instance’s context. 

\param[in] iface Pointer to the algorithm interface to 
use. 

\param[in] ver ABI version number. Must be set to 


VPX_DECODER_ABI_VERSION 


\retval #VPX_DEC_OK 

The decoder algorithm initialized. 
\retval #VPX_DEC_ERROR 

Decoder does not support XMA mode. 


dec_err_t vpx_dec_xma_init_ver (vpx_dec_ctx_t *ctx, 
vpx dec iface t  *iface, 
int ver) DEPRECATED; 


vpx. dec xma init(ctx, iface) N 
dec xma init ver(ctx, iface, VPX DECODER ABI VERSION) 


\brief Iterate over the list of segments to allocate. 


Iterates over a list of the segments to allocate. The 
iterator storage should be initialized to NULL to start the 
iteration.  Iteration is complete when this function returns 
VPX DEC LIST END. The amount of memory needed to allocate is 
dependent upon the size of the encoded stream. This means 
that the stream info structure must be known at allocation 
time. It can be populated with the vpx dec peek stream info() 
function. In cases where the stream to be decoded is not 
available at allocation time, a fixed size must be requested. 
The decoder will not be able to decode streams larger than the 
Size used at allocation time. 
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\param[in] ctx Pointer to this instance’s context. 
Nparam[out] mmap Pointer to the memory map entry to 

populate. 
\param[in] si Pointer to the stream info. 
\param[in out] iter Iterator storage, initialized to NULL 


\retval #VPX_DEC_OK 
The memory map entry was populated. 
\retval #VPX_DEC_ERROR 
Decoder does not support XMA mode. 
\retval #VPX_DEC_MEM_ERROR 
Unable to determine segment size from stream info. 


Vpx dec err t vpx_dec_get_mem_map ( 
vpx dec ctx t *ctx, 
Vpx dec mmap t *mmap, 
const vpx dec stream info t  *si, 
vpx dec iter t *iter) DEPRECATED; 


/*! 


+ + A 0X 0X 0 Oo oo F F F FF 0 0 0 0 FF OR OR OR OR Ro X 


+ 
E 
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\brief Identify allocated segments to decoder instance 


Stores a list of allocated segments in the decoder.  Segments 
Xref MUST be passed in the order they are read from 

vpx. dec get. mem map(), but may be passed in groups of any 
size. Segments \ref MUST be set only once. The allocation 
function \ref MUST ensure that the vpx dec mmap t::base member 
is non-NULL. If the segment reguires cleanup handling (e.g., 
calling free() or close()) then the vpx dec mmap t::dtor 
member \ref MUST be populated. 


Nparam[in] ctx Pointer to this instance's context. 

Nparam[in] mmaps Pointer to the first memory map 
entry in the list. 

\param[in] num_maps Number of entries being set at this 
time 


\retval #VPX_DEC_OK 
The segment was stored in the decoder context. 
\retval #VPX_DEC_ERROR 
Decoder does not support XMA mode. 
Nretval #VPX_DEC_MEM ERROR 
Segment base address was not set, or segment was already 
stored. 
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Vpx dec err t vpx dec set mem map( 


vpx dec ctx t CEUX; 
vpx dec mmap t  *mmaps, 
unsigned int num maps) DEPRECATED; 


/*!@} — end defgroup cap xma*/ 
/*!@} - end defgroup decoder*/ 


fendif 

#ifdef _ cplusplus 

ii 

eee Rndccode block m5 Es oo Sse SaaS BA o o m6 N (o (rs sr 
20.22.  vpx image.c 


==> Begin code: block: — SaaS aS aaa a DIE sass 


/* 
* Copyright (c) 2010, 2011, Google Inc. All rights reserved. 

* 

* Use of this source code is governed by a BSD-style license 

* that can be found in the LICENSE file in the root of the source 
* tree. An additional intellectual property rights grant can be 

* found in the file PATENTS. All contributing project authors may 
* be found in the AUTHORS file in the root of the source tree. 

*/ 


finclude <stdlib.h> 
finclude <string.h> 
finclude "vpx/vpx image.h" 


static vpx image t "img alloc helper(vpx image t "img, 
vpx. img fmt t fmt, 
unsigned int d w, 
unsigned int d h, 
unsigned int  stride align, 
unsigned char *img data) 
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unsigned int h, w, S, xcs, ycs, bps; 
int align; 


/* Treat align--0 like align--1 */ 
if (!stride align) 
stride align < 1; 


/* Validate alignment (must be power of 2) */ 
if (stride align & (stride align - 1)) 
goto fail; 


/* Get sample size for this format */ 
Switch (fmt) 

( 

case VPX IMG FMT RGB32: 

case VPX IMG FMT RGB32 LE: 

case VPX IMG FMT ARGB: 


I 


case VPX IMG FMT ARGB LE: 
bps = 32; 
break; 
case VPX_IMG_FMT_RGB24: 
case VPX_IMG_FMT_BGR24: 
bps = 24; 
break; 


case VPX_IMG_FMT_RGB565: 
case VPX_IMG_FMT_RGB565_LE: 
case VPX_IMG_FMT_RGB555: 
case VPX_IMG_FMT_RGB555_LE: 
case VPX IMG FMT UYVY: 
case VPX IMG FMT YUY2: 
case VPX IMG FMT YVYU: 

bps = 16; 

break; 
case VPX IMG FMT I420: 
case VPX IMG FMT YV12: 
case VPX IMG FMT VPXI420: 
case VPX IMG FMT VPXYV12: 

bps < 12; 

break; 
default: 

bps = 16; 

break; 
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/* Get chroma shift values for this format */ 
switch (fmt) 
{ 
case VPX IMG FMT I420: 
case VPX IMG FMT YV12: 
case VPX IMG FMT VPXI420: 
case VPX IMG FMT VPXYV12: 
xcs = 1; 
break; 
default: 
xcs = 0; 
break; 


} 


switch (fmt) 
{ 
case VPX IMG FMT I420: 
case VPX IMG FMT YV12: 
case VPX IMG FMT VPXI420: 
case VPX IMG FMT VPXYV12: 
yes = 1; 
break; 
default: 
ycs = 0; 
break; 


} 


/* Calculate storage sizes given the chroma subsampling */ 


align = (1 << xcs) - 1; 

w = (d_w + align) & “align; 

align = (1 << ycs) - 1; 

h = (dh + align) s “align; 

s = (fmt & VPX IMG FMT PLANAR) ? w : bps * w / 8; 
s — (s t stride align - 1) & ^(stride align - 1); 


/* Allocate the new image */ 


if (!img) 
{ 
img = (vpx image t *)calloc(1, sizeof(vpx image t)); 
if (!img) 
goto fail; 


img->self allocd = 1; 
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else 


{ 
memset (img, 0, sizeof(vpx image t)); 


} 
img->img_data = img_data; 


if (!img_data) 
{ 
img->img_data = malloc((fmt & VPX_IMG_FMT_PLANAR) 
h * w * bps / 8 : h " s); 
img->img data owner = 1; 


} 


if (!img->img_data) 
goto fail; 


img->fmt = fmt; 


img->w = w; 
img->h = h; 
img->x_chroma_shift = xcs; 
img->y_chroma_shift = ycs; 


img->bps = bps; 


/* Calculate strides */ 


P 


img->stride[VPX PLANE Y] = img->stride[VPX PLANE ALPHA] = s; 


img->stride[VPX PLANE U] = img->stride[VPX PLANE V] = 


/* Default viewport to entire image */ 
if (!vpx img set rect(img, 0, 0, dw, d h)) 
return img; 


fail: 
vpx img free(img); 
return NULL; 

} 


vpx image t *vpx img alloc(vpx image t  *img, 
vpx img fmt t fmt, 
unsigned int d w, 
unsigned int d h, 
unsigned int  stride align) 


s >> XCS; 


return img alloc helper(img, fmt, d w, d h, stride align, NULL); 
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vpx image t *vpx img wrap(vpx image t  *img, 


vpx img fmt t fmt, 

unsigned int d w, 

unsigned int d h, 

unsigned int  stride align, 
unsigned char *img data) 


return img alloc helper(img, fmt, d w, d h, stride align, 


} 


img data); 


int vpx img set rect(vpx image t "img, 


unsigned int x, 
unsigned int y, 
unsigned int w, 
unsigned int h) 


unsigned char *data; 


if 


{ 
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(x + w <= img->w && y + h <= img->h) 


img->d_w = w; 
img->d_h = h; 


/* Calculate plane pointers */ 
if (!(img->fmt & VPX_IMG_FMT_PLANAR) ) 
{ 
img->planes[VPX PLANE PACKED] = 
img->img data + x * img->bps / 8 + y x 
img->stride[VPX PLANE PACKED]; 


else 
data = img->img data; 


if (img->fmt & VPX IMG FMT HAS ALPHA) 


img->planes[VPX PLANE ALPHA] < 
data t x t y % img->stride[VPX PLANE ALPHA]; 
data += img->h * img->stride[VPX PLANE ALPHA]; 
} 


img->planes[VPX PLANE Y] = 


data t x t y % img->stride[VPX PLANE Y]; 
data += img->h * img->stride[VPX PLANE Y]; 
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if (!(img-»fmt € VPX IMG FMT UV FLIP)) 
{ 


img->planes[VPX PLANE U] = data 
+ (x >> img-»x chroma shift) 
+ (y >> img->y chroma shift) * 
img->stride[VPX PLANE U]; 
data += (img->h >> img->y chroma shift) * 
img->stride[VPX PLANE U]; 
img->planes[VPX PLANE V] = data 
+ (x >> img->x chroma shift) 
+ (y >> img->y chroma shift) * 
img->stride[VPX PLANE V]; 
} 
else 
{ 
img->planes[VPX PLANE V] = data 
+ (x >> img-»x chroma shift) 
+ (y >> img->y chroma shift) * 
img->stride[VPX PLANE V]; 
data += (img->h >> img->y chroma shift) * 
img->stride[VPX PLANE V]; 
img->planes[VPX PLANE U] = data 
+ (x >> img->x chroma shift) 
+ (y >> img->y chroma shift) * 
img->stride[VPX PLANE U]; 


} 


return 0; 


} 


return -1; 


} 


void vpx_img_flip(vpx_image_t *img) 
{ 

/* Note: In the calculation pointer adjustment calculation, we 
want the rhs to be promoted to a signed type. Section 6.3.1.8 
of the ISO C99 standard [ISO-C99] indicates that if the 
adjustment parameter is unsigned, the stride parameter will be 
promoted to unsigned, causing errors when the lhs is a larger 
type than the rhs. 

/ 
img->planes[VPX PLANE Y] += (signed) 

(img->d h - 1) * img->stride[VPX PLANE Y]; 
img->stride[VPX PLANE Y] = -img->stride[VPX PLANE Y]; 
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img->planes[VPX PLANE U] += (signed) 


((img-»d h >> img->y chroma shift) - 1) 
* img->stride[VPX PLANE U]; 
img->stride[VPX PLANE U] = -img->stride[VPX PLANE U]; 


img->planes[VPX PLANE V] t< (signed) 

((img->d h >> img->y chroma shift) - 1) * 
img->stride[VPX PLANE V]; 
img->stride[VPX PLANE V] = -img->stride[VPX PLANE V]; 


img->planes[VPX PLANE ALPHA] t< (signed) 
(img->d h - 1) * img->stride[VPX PLANE ALPHA]; 
img->stride[VPX PLANE ALPHA] = -img->stride[VPX PLANE ALPHA]; 


} 


void vpx_img_free(vpx_image_t *img) 
{ 
if (img) 
{ 
if (img->img data && img->img data owner) 
free (img->img data); 


if (img->self allocd) 
free(img); 


} 

=== End code: block, SaaS s SS Ses Se SSR SS SS SSS 
20.23.  vpx image.h 

roba Beginacode blOočki StS tee EI SS SS SSeS DB > Z 


/ 
Copyright (c) 2010, 2011, Google Inc. All rights reserved. 


Use of this source code is governed by a BSD-style license 

that can be found in the LICENSE file in the root of the source 
tree. An additional intellectual property rights grant can be 
found in the file PATENTS. All contributing project authors may 
be found in the AUTHORS file in the root of the source tree. 

/ 


+ + + + AR Xo F Xo F 


/*!\file 
* \brief Describes the vpx image descriptor and associated 


* operations 
* 


ši 
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#ifdef _ cplusplus 


extern 
Hendif 


"E " { 


#ifndef VPX_IMAGE_H 
#define VPX_IMAGE_H 


/*!Nbrief Current ABI version number 


+ xoxo AR ER O* > 


/ 


#define VPX IMAGE ABI VERSION 


\internal 
If this file is altered in any way that changes the ABI, 
value must be bumped. 
to, 


changi 


ng types, 


#define VPX_IMG_FMT_PLANAR 


#define VPX_IMG_FMT_UV_FLIP 


#define VPX IMG FMT HAS ALPHA 


/*!\brief List of supported image 


(1) 


0x100 


0x200 


0x400 


typedef enum vpx img fmt { 
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Examples include, 
removing or reassigning enums, 
adding/removing/rearranging fields to structures 


/**«Nhideinitializer*/ 


November 2011 


this 


but are not limited 


/**« Image is a planar 


format */ 


/**« V plane precedes U plane 


in memory */ 


/**« Image has an alpha channel 


component */ 


formats */ 


24 bit per pixel packed RGB */ 
32 bit per pixel packed ORGB */ 
16 bit per pixel, 565 */ 

16 bit per pixel, 555 */ 


UYVY packed YUV */ 
YUYV packed YUV */ 
YVYU packed YUV */ 
24 bit 
32 bit 
32 bit packed ARGB, 
32 bit packed BGRA, 
16 bit per pixel, 
gggbbbbb rrrrrggg */ 
16 bit per pixel, 
gggbbbbb Orrrrrgg */ 


— VPX IMG FMT PLANAR | 


FMT UV FLIP | 1, /**« planar YVU */ 


— VPX IMG FMT PLANAR | 2, 


VPX IMG FMT NONE, 
VPX IMG FMT RGB24, /**« 

VPX IMG FMT RGB32, /**« 

VPX IMG FMT RGB565, /**« 

VPX IMGFMT RGB555, /**« 

VPX IMG FMT UYVY, /**« 

VPX IMG FMT YUY2, /**« 

VPX IMG FMT YVYU, /**« 

VPX IMG FMT BGR24, /**« 

VPX IMG FMT RGB32 LE, /**« 

VPX IMG FMT ARGB, /**« 

VPX IMG FMT ARGB LE, /**« 

VPX IMG FMT RGB565 LE,  /**« 

VPX IMG FMT RGB555 LE,  /**« 

VPX IMG FMT YV12 

VPX IMG 
VPX IMG FMT I420 
et al. Informational 


per pixel packed BGR */ 
packed BGRO */ 

alpha-255 */ 
alpha-255 */ 
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VPX IMG FMT VPXYV12 = VPX IMG FMT PLANAR | 
VPX_IMG_FMT_UV_FLIP | 3, /** < planar 4:2:0 format with 
vpx color space */ 
VPX IMG FMI VPXI420 = VPX IMG FMT PLANAR | 4 /** < planar 
4:2:0 format with vpx color space */ 


} 


vpx_img_fmt_t; /**< alias for enum vpx_img_fmt */ 


#if !defined(VPX CODEC DISABLE COMPAT) || !VPX CODEC DISABLE COMPAT 
/** \deprecated Use 4VPX IMG FMT PLANAR */ 

#define IMG FMT PLANAR VPX IMG FMT PLANAR 

/** Ndeprecated Use 4VPX IMG FMT UV FLIP */ 

#define IMG FMT UV FLIP VPX IMG FMT UV FLIP 

/** \deprecated Use 4£VPX IMG FMT HAS ALPHA */ 

#define IMG FMT HAS ALPHA VPX IMG FMT HAS ALPHA 


/*!\brief Deprecated list of supported image formats 
x Ndeprecated New code should use fvpx img fmt 
*/ 
#define img fmt vpx img fmt 
/*!Nbrief alias for enum img fmt. 
x Ndeprecated New code should use 4vpx img fmt t 
E 
fdefine img fmt t vpx img fmt t 


/** Ndeprecated Use 4VPX IMG FMT NONE */ 
#define IMG FMT NONE VPX IMG FMT. NONE 
/** Ndeprecated Use 4VPX IMG FMT RGB24 */ 
#define IMG FMT RGB24 VPX IMG FMT. RGB24 
/** Ndeprecated Use 4VPX IMG FMT RGB32 */ 
#define IMG FMT RGB32 VPX IMG FMT. RGB32 
/** \deprecated Use 4VPX IMG FMT RGB565 */ 
#define IMG FMT RGB565 VPX IMG FMT. RGB565 
/** \deprecated Use 4VPX IMG FMT RGB555 */ 
#define IMG FMT RGB555 VPX IMG FMT. RGB555 
/** \deprecated Use 4VPX IMG FMT UYVY */ 
#define IMG FMT UYVY VPX IMG FMT UYVY 
/** \deprecated Use £4VPX IMG FMT YUY2 */ 
#define IMG FMI YUY2 VPX IMG FMT YUY2 
/** Ndeprecated Use 4VPX IMG FMT YVYU */ 
define IMG FMT YVYU VPX IMG FMT. YVYU 
/** Ndeprecated Use 4VPX IMG FMT BGR24 */ 
define IMG FMT BGR24 VPX IMG FMT. BGR24 
/**« \deprecated Use 4VPX IMG FMT RGB32 LE */ 
define IMG FMT RGB32 LE VPX IMG FMT RGB32 LE 
/** Ndeprecated Use 4VPX IMG FMT ARGB */ 
#define IMG FMT ARGB VPX IMG FMT ARGB 


I 


I 


j 


T 


I 


I 
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/** Ndeprecated Use £4VPX IMG FMT ARGB LE */ 


#define IMG FMT ARGB LE 


VPX IMG FMT ARGB LE 


/** \deprecated Use £4VPX IMG FMT 


#define IMG FMT RGB565 LE 


/** \deprecated Use 4VPX IMG FMT 
VPX IMG FMT RGB555 LE 


#define IMG FMT RGB555 LE 


/** \deprecated Use 4VPX IMG FMT 


#define IMG FMT YV12 


/** Ndeprecated Use #VPX_IMG_FMT_ 
VPX IMG FMT I420 


#define IMG FMT I420 


/** \deprecated Use 4VPX IMG FMT 


#define IMG FMT VPXYV12 


/** Ndeprecated Use 4VPX IMG FMT 
VPX IMG FMT VPXI420 
#endif /* VPX CODEC DISABLE COMPAT */ 


#define IMG FMT VPXI420 


/**Nbrief Image Descriptor */ 


typedef struct vpx im 


{ 
vpx_img_fmt_t fmt 


/* Image storage 
unsigned int wy; 
unsigned int h; 


/* Image display 
unsigned int d w 
unsigned int d h 


age 


VPX IMG F 


RGB565 LE */ 


VPX IMG FMT RGB565. LE 


RGB555 LE */ 


YV12 */ 
1420 */ 


VPXYV12 */ 


VPX IMG FMT VPXYV12 
' VPXI420 */ 


; /**« Image Format */ 


dimensions */ 
/**« Stored image width */ 
/**« Stored image height */ 


dimensions */ 
; /**< Displayed image width */ 
; /**< Displayed image height */ 


/* Chroma subsampling info */ 
unsigned int x chroma shift; /**« subsampling order, X */ 
unsigned int y chroma shift; /**« subsampling order, Y */ 


/* Image data poi 
#define VPX PLANE PACKED 
#define VPX PLANE Y 
#define VPX PLANE U 
#define VPX PLANE V 
#define VPX PLANE ALPHA 
#if !defined(VPX CODEC DI 
#define PLANE PACKED 
#define PLANE Y 
#define PLANE U 
#define PLANE V 
#define PLANE ALPHA 
fendif 
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nters. 


0 /**« 
0 /**« 
1 /**« 
2 /**« 
3 /**« 


ai 


To be used for all packed formats %/ 
Y (Luminance) plane %/ 

U (Chroma) plane %/ 

V (Chroma) plane */ 

A (Transparency) plane %/ 


SABLE COMPAT) | | !VPX CODEC DISABLE COMPAT 
VPX PLANE PACKED 

VPX PLANE Y 

VPX PLANE U 

VPX PLANE V 

VPX PLANE ALPHA 
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unsigned char "planes[4];  /**« pointer to the top-left pixel 
q for each plane */ 

int stride[4]; /**< stride between rows for each plane %/ 
int bps; /**« bits per sample (for packed formats) */ 


/* The following member may be set by the application to 
* associate data with this image. 
av 
void *user priv; /**< may be set by the application to 
associate data with this image. */ 


/* The following members should be treated as private. */ 


unsigned char *img data; /**« private */ 
int img data owner; /**« private */ 
int self allocd; /**« private */ 


) vpx image t; /**« alias for struct vpx image */ 


/**\brief Representation of a rectangle on a surface */ 
typedef struct vpx image rect 


{ 


unsigned int x; /**< leftmost column */ 
unsigned int y; /**< topmost row */ 
unsigned int w; /**< width */ 

unsigned int h; /**< height */ 


} vpx_image_rect_t; /**< alias for struct vpx_image_rect */ 


/*!\brief Open a descriptor, allocating storage for the 


+ + + + oko + ooo CAC ACA FF F F OR Ro F X 
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underlying image 


Returns a descriptor for storing an image of the given format. 
The storage for the descriptor is allocated on the heap. 


\param[in] img Pointer to storage for descriptor. 
If this parameter is NULL, the storage 
for the descriptor will be allocated 
on the heap. 


\param[in] fmt Format for the image 

\param[in] d_w Width of the image 

\param[in] d_h Height of the image 

\param[in] align Alignment, in bytes, of each row in 


the image. 
\return Returns a pointer to the initialized image descriptor. 


If the img parameter is non-null, the value of the img 
parameter will be returned. 
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vpx image t *vpx img alloc(vpx image t  *img, 
vpx img fmt t fmt, 
unsigned int d w, 
unsigned int d h, 
unsigned int align); 


/*!\brief Open a descriptor, using existing storage for the 
* underlying image 


* 

* Returns a descriptor for storing an image of the given format. 
* The storage for descriptor has been allocated elsewhere, and a 
* descriptor is desired to "wrap" that storage. 

* 

* Nparam[in] img Pointer to storage for descriptor. 

* If this parameter is NULL, the storage 
x for the descriptor will be 

* allocated on the heap. 

* Nparam[in] fmt Format for the image 

* Nparam[in] d w Width of the image 

x Nparam[in] d h Height of the image 

x Nparam[in] align Alignment, in bytes, of each row in 

X the image. 

* Nparam[in] img data Storage to use for the image 

* 

* Nreturn Returns a pointer to the initialized image descriptor. 
A If the img parameter is non-null, the value of the img 
x parameter will be returned. 

ai 


vpx image t *vpx img wrap(vpx image t  *img, 
vpx img fmt t fmt, 
unsigned int d w, 
unsigned int d h, 
unsigned int align, 
unsigned char *img data); 


/*!Nbrief Set the rectangle identifying the displayed portion of 


* the image 

* 

* Updates the displayed rectangle (aka viewport) on the image 
* surface to match the specified coordinates and size. 
* 

* Nparam[in] img Image descriptor 

* Nparam[in] x leftmost column 

x Nparam[in] y topmost row 

x Nparam[in] wW width 

* \param[in] h height 

* 


Bankoski, et al. Informational [Page 296] 


RFC 6386 VP8 Data Format and Decoding Guide November 2011 


* Nreturn 0 if the requested rectangle is valid, non-zero 
* otherwise. 
El 
int vpx img set rect(vpx image t  *img, 
unsigned int x, 
unsigned int y, 
unsigned int w, 
unsigned int h); 


/*!\brief Flip the image vertically (top for bottom) 


* Adjusts the image descriptor's pointers and strides to make 
* the image be referenced upside-down. 

* 

* Nparam[in] img Image descriptor 

%/ 


void vpx img flip(vpx image t "img); 


/*!Nbrief Close an image descriptor 


* 

* Frees all allocated storage associated with an image 
* descriptor. 

* 

* Nparam[in] img Image descriptor 

x 


void vpx img free(vpx image t *img); 


fendif 
#ifdef _ cplusplus 


) 
dendif 


255 Eno coge block. ==> Sa RSS Seas 
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20.24. 


vpx integer.h 


smmm Begin code block === 220932 2 SH HS 32 


/ 


+ + Xo £ X ER F O F 


/ 


Copyright (c) 2010, 2011, Google Inc. All rights reserved. 


Use of this source code is governed by a BSD-style license 

that can be found in the LICENSE file in the root of the source 
tree. An additional intellectual property rights grant can be 
found in the file PATENTS. All contributing project authors may 
be found in the AUTHORS file in the root of the source tree. 


#ifndef VPX INTEGER H 
#define VPX INTEGER H 


/* get ptrdiff t, size t, wchar t, NULL */ 
#include <stddef.h> 


#if defined( MSC VER) || defined(VPX EMULATE INTTYPES) 
typedef signed char  int8 t; 

typedef signed short intl6 t; 

typedef signed int int32 t; 

typedef unsigned char  uint8 t; 

typedef unsigned short uintl6 t; 

typedef unsigned int uint32 t; 


#if defined( MSC VER) 

typedef signed __int64 int64 t; 
typedef unsigned _ int64 uint64 t; 
#define PRId64 "I64d" 

fendif 


#ifdef HAVE ARMV6 

typedef unsigned int int fastl6 t; 
felse 

typedef signed short int fastl6 t; 
#endif 

typedef signed char int fast8 t; 
typedef unsigned char uint fast8 t; 


#ifndef | UINTPTR T DEFINED 
typedef unsigned int uintptr t; 
fendif 
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felse 

/ Most platforms have the C99 standard integer types. %/ 

#if defined(__cplusplus) && !defined(  STDC FORMAT MACROS) 

#define STDC FORMAT. MACROS 

#endif 

#include <stdint.h> 

#include <inttypes.h> 

#endif 

#endif 

ees End code BLOCK 55555 —SSeSsS 2259902959222 HH 
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.26. 
Copyright 


Redistribution and use in source and binary forms, 


modification, are permitted provided that the following conditions 


LICENSE 


(c) 2010, 2011, Google Inc. 


are met: 


o 


Redistributions of source code must 
notice, this list of conditions and 


Redistributions in binary form must 
notice, this list of conditions and 
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All rights reserved. 


with or without 


retain the above copyright 
the following disclaimer. 


reproduce the above copyright 
the following disclaimer in 


the documentation and/or other materials provided with the 


distribution. 


Neither the name of Google nor the names of its contributors may 
be used to endorse or promote products derived from this software 


without specific prior written perm 


ission. 


THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS 


"AS IS" 
LIMITED TO, 


A PARTICULAR PURPOSE ARE DISCLAIMED. 


HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, 
INCIDENTAL, 
BUT NOT LIMITED TO, 
OF USE, 
AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, 
(INCLUDING NEGLIGENCE OR OTHERWISE) 
EVEN IF ADVISED OF THE 


LIABILITY, OR TORT 
WAY OUT OF THE USE OF THIS SOFTWARE, 


SPECIAL, EXEMPLARY, OR CON 
PROCUREMENT OF SUB 


DATA, OR PROFITS; OR BUSINESS 


POSSIBILITY OF SUCH DAMAGE. 


et al. Information 


AND ANY EXPRESS OR IMPLIED WARRANTIES, 
THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR 


INCLUDING, BUT NOT 


IN NO EVENT SHALL THE COPYRIGHT 
INDIRECT, 

SEQUENTIAL DAMAGES (INCLUDING, 
STITUTE GOODS OR SERVICES; LOSS 
INTERRUPTION) HOWEVER CAUSED 
STRICT 

ARISING IN ANY 
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20.27. PATENTS 
Additional IP Rights Grant (Patents) 


"This implementation" means the copyrightable works distributed by 
Google as part of the WebM Project. 


Google hereby grants to you a perpetual, worldwide, non-exclusive, 
no-charge, royalty-free, irrevocable (except as stated in this 
section) patent license to make, have made, use, offer to sell, sell, 
import, transfer, and otherwise run, modify and propagate the 
contents of this implementation of VP8, where such license applies 
only to those patent claims, both currently owned by Google and 
acquired in the future, licensable by Google that are necessarily 
infringed by this implementation of VP8. This grant does not include 
claims that would be infringed only as a consequence of further 
modification of this implementation. If you or your agent or 
exclusive licensee institute or order or agree to the institution of 
patent litigation against any entity (including a cross-claim or 
counterclaim in a lawsuit) alleging that this implementation of VP8 
or any code incorporated within this implementation of VP8 
constitutes direct or contributory patent infringement, or inducement 
of patent infringement, then any patent rights granted to you under 
this License for this implementation of VP8 shall terminate as of the 
date such litigation is filed. 


21. Security Considerations 


A VP8 decoder should take appropriate security considerations into 
account, as outlined in [RFC4732] and [RFC3552]. It is extremely 
important that a decoder be robust against malicious payloads. 
Malicious payloads must not cause the decoder to overrun its 
allocated memory or to consume inordinate resources. Although 
encoder issues are typically rarer, the same applies to an encoder. 
Malicious stream data must not cause the encoder to misbehave, as 
this might allow an attacker access to transcoding gateways. 
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